\ 



con ' -■■ ■ 



PCT 



WORLD 



\ I your Ref -FP-KC.- o'vv c 

Irtt*»i-p»oti/-vri"it T3itr A n.> * -•■ 




International Bureau 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



Internationa! Patent Classification 6 : 
C12N 9/28, CUD 3/386 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 99/23211 

14 May 199*9 (14.05.99) 



(21) International Application Number: PCT/DK 98/00471 

(22) International Filing Date: 30 October 1998 (30.10.98) 



(30) Priority Data: 

1 240/97 30 October 1 997 (30. 1 0.97) DK 

PA 1998 00936 14 July 1998 (14.07.98) DK 



(71) Applicant: NOVO NORDISK A/S [DK/DK]; Novo Alle, 

DK-2880 Bagsv<erd (DK). 

(72) Inventors: BORCHERT, Torben, Vedel; Novo Nordisk A/S, 

Novo Alle, DK-2880 Bagsvaerd (DK). SVENDSEN, Allan; 
Novo Nordisk A/S, Novo Alle, DK-2880 Bagsvaerd (DK). 
ANDERSEN, Carsten; Novo Nordisk A/S, Novo Alle, 
DK-2880 Bagsvasrd (DK). NIELSEN, Bjarne, R0nfeldt; 
Novo Nordisk A/S, Novo Alle, DK-2880 Bagsvaerd (DK). 
N1SSEN, Torben, Lauesgaard; Novo Nordisk A/S, Novo 
Alle, DK-2880 Bagsvaerd (DK). KJAERULFF, Soren; 
Novo Nordisk A/S, Novo Alle, DK-2880 Bagsvaerd (DK). 

(74) Agent: NOVO NORDISK A/S; Corporate Patents, Novo Alle, 
DK-2880 Bagsvaerd (DK). 



(81) Designated States: AL, AM, AT. AU. AZ, BA, BB f BG BR 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB,' GD* 
GE, GH, GM, HR, HU, ID, IL, IS, JP, KE, KG, KP, KR^ 
KZ, LC, LK, LR, LS, LT, LU. LV, MD. MG, MK, MN 
MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, 
SL, TJ, TM. TR, TT, UA, UG, UZ, VN, YU, ZW, ARIPO 
patent (GH, GM, KE. LS, MW, SD, SZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG. KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, H, FR, GB. GR, 
IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). ' 

Published 

With international search report. 



(54) Title: cr- AMYLASE MUTANTS 
(57) Abstract 



The invention relates to a variant of a parent Termamyl-like a-amylase, which exhibits an alteration in at least one of the followine 
properties relative to said parent a-amylase: i) improved pH stability at a pH from 8 to 10.5; and/or ii) improved Ca 2+ stability at pH 8 to 
10.5, and/or m) increased specific activity at temperatures from 10 to 60 °C 



8NSOOCID: <WO 992321 1A1J_> 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


AM 


Armenia 


AT 


Austria 


AU 


Australia 


AZ 


Azerbaijan 


BA 


Bosnia and Herzegovina 


BB 


Barbados 


BE 


Belgium 


BF 


Burkina Faso 


BG 


Bulgaria 


BJ 


Benin 


BR 


Brazil 


BY 


Belarus 


CA 


Canada 


CF 


Central African Republic 


CG 


Congo 


CH 


Switzerland 


CI 


Cote d'lvoire 


CM 


Cameroon 


CN 


China 


cu 


Cuba 


cz 


Czech Republic 


DE 


Germany 


DK 


Denmark 


EE 


Estonia 



ES 


Spain 


FI 


Finland 


FR 


France 


CA 


Gabon 


GB 


United Kingdom 


GE 


Georgia 


GH 


Ghana 


GN 


Guinea 


GR 


Greece 


HU 


Hungary 


JE 


Ireland 


1L 


Israel 


IS 


Iceland 


IT 


Italy 


JP 


Japan 


KE 


Kenya 


KG 


Kyrgyzstan 


KP 


Democratic People's 




Republic of Korea 


KR 


Republic of Korea 


KZ 


Kazakstan 


LC 


Saint Lucia 


U 


Liechtenstein 


LK 


Sri Lanka 


LR 


Liberia 



LS 


Lesotho 


LT 


Lithuania 


LU 


Luxembourg 


LV 


Latvia 


MC 


Monaco 


MD 


Republic of Moldova 


MG 


Madagascar 


MK 


The former Yugoslav 




Republic of Macedonia 


ML 


Mali 


MN 


Mongolia 


MR 


Mauritania 


MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SG 


Singapore 



SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


sz 


Swaziland 


TD 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TM 


Turkmenistan 


TR 


Turkey 


TT 


Trinidad and Tobago 


UA 


Ukraine 


UG 


Uganda 


US 


United States of America 


uz 


Uzbekistan 


VN 


Viet Nam 


YU 


Yugoslavia 


ZW 


Zimbabwe 



NSOOCID: <WO 992321 1A1J_> 



WO 99/23211 



PCT/DK98/00471 



10 



a-amylase mutants 

FIELD OF THE INVENTION 

The present invention relates to variants (mutants) of parent 
5 Termamyl-like a-amylases with higher activity at medium 
temperatures and/or high pH. 

BACKGROUND OF THE INVENTION 

a-Amylases (a-1, 4-glucan-4 -glucanohydrolases, EC 3.2.1.1) 
constitute a group of enzymes which catalyze hydrolysis of 
starch and other linear and branched 1 , 4-glucosidic oligo- and 
polysaccharides . 

There is a very extensive body of patent and scientific 
literature relating to this industrially very important class of 
15 enzymes. A number of a-amylases such as Termamyl-like a- 
amylases variants are known from e.g. WO 90/11352, WO 95/1060.3, 
WO 95/26397, WO 96/23873 and WO 96/23874. 

Among more recent disclosures relating to a-amylases, WO 
96/238 74 provides three-dimensional, X-ray crystal structural 
20 data for a Termamyl-like a-amylase which consists of the 300 N- 
- terminal amino acid residues of the B. amyloliquefaciens 
a-amylase (BAN™) and amino acids 301-483 of the C-terminal end 
of the B. llcheniformis a-amylase comprising the amino acid 
sequence (the latter being available commercially under the 
25 tradename Termamyl™) , and which is thus closely related .to the 
industrially important Bacillus a-amylases (which in the present 
context are embraced within the meaning of the term "Termamyl- 
like a-amylases", and which include, inter alia, the B. 
licheniformis, B. amyloliquefaciens (BAN™) and B. 
stearothermophilus (BSG™ ) a-amylases) . WO 96/23874 further 
describes methodology for designing, on the basis of an analysis 
of the structure of a parent Termamyl-like a-amylase, variants 
of the parent Termamyl-like a-amylase which exhibit altered 
properties relative to the parent. 



30 



35 
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BRIEF DISCLOSURE OF THE INVENTION 

lue present invention relates to novel a-amylolytic variants 
(mutants) of a Termamyl-like a-amylase which exhibit improved 

wash performance (relative to the parent oc-amaylase) at high pH 
5 and at a medium temperature. 

. The term "medium temperature" means in the context of the 
invention a temperature from 10°C to 60°C, preferably 20°C to 
50°C, especially 30-40°C. 

The term "high pH" means the alkaline pH which is today used 
10 for washing, more specifically from about pH 8 to 10,5. 

In the context of the invention a "low temperature a- 
amylase" means an a-amylase which has a relative optimum 
activity in the temperature range from 0-30°C. 

In the context of the invention a "medium temperature a- 
15 amylase" means an a-amylase which has an optimum activity in the 
temperature range from 30-60°C. For instance, SP690 and SP722 ot- 
amylases, respectively, are "medium temperature a-amylases. 

In the context of the invention a "high temperature a- 
amylase" is an a-amylase having the optimum activity in the 
20 temperature range from 60-110°C. For instance, Termamyl is a 
"high temperature a-'amylase. 

Alterations in properties .which may be achieved in variants 
(mutants) of the invention are alterations in: 

The stability of the Termamyl-like a-amylase at a pH from 8 to 
25 10.5, and/or the Ca 2+ stability at pH 8 to 10.5, and/or the 

specific activity at temperatures from 10 to 60°C, preferably 20- 

50°C, especially 30-40°C. 

It should be noted that the relative temperature optimum is 

often dependent on the specific pH used. In other words the 
30 relative temperature optimum determined at, e.g. pH 8, may be 

substantially different from the relative temperature optimum 

determined at, e.g., pH 10. 

The temperature's influence on the enzymatic activity 
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The dynamics in the active site and surroundings are 
dependent on the temperature and the amino acid composition and 
of strong importance for the relative temperature optimum of an 
enzyme. By comparing the dynamics of medium and high temperature 
5 a-amylases, regions of importance for the function of high 
temperature a-amylases at medium temperatures can be determined. 
The temperature activity profile of the SP722 a-amaylase (SEQ ID 
NO: 2) and the B. licheniformis a-amylase (available from Novo 
Nordisk as Termamyl®) (SEQ ID NO: 4) are shown in Figure 2. 

10 The relative temperature optimum of SP722 in absolute 

activities is shown to be higher at medium range temperatures 
(30-60°C) than the homologous B. licheniformis a-amylase, which 
has an optimum activity around 60-100°C. The profiles are mainly 
dependent on the temperature stability and the dynamics of the 

15 active site residues and their surroundings. Further, the 
activity profiles are dependent on the pH used and the pKa of 
the active site residues. 

In the first aspect the invention relates to a variant of a 
parent Termamyl-like a-amylase, which variant has a-amylase 

20 activity, said variant comprises one or more mutations 
corresponding to the ^fol^wAng mutations in the amino acid 
isequence shown in SE@^ID NO: 2T"*\ 

T141, K142, F143, '^^^F T^—^T^S , G147, R148, G149, 

Q174, R181, G182, D183, G184, K185, A186, W189, S193, N195, 
25 H107, K108, G109,D166, W167, D168, Q169, S170, R171, Q172, F173, 

F267, W268, K269, N270, D271, L272, G273, A274, L275, K311, 

E346, K385, G456, N457, K458, P459, G460, T461, V462, T463. 

A variant of the invention have one or more of the following 

substitutions or deletions: 
30 T141A, D, R, N, C, E, Q, G, H, I,L,K,M, F, P, S, W, Y, V; 

K142A, D, R, N, C, E, Q, G, H, I , L, M, F, P, S, T, W, Y, V; 

F143A, D,R,N,C,E,Q,G,H, I,L, K,M, P, S, T, W, Y, V; 
[DT44P^R r ^ f C f E,Q f G f H f I f L f K f M f F, P, S, T, W, Y, V; 

F145A, D,R,N,C,E,Q,G,H, I , L, K, M, P,S,T,W, Y,V; 
35 P14 6A, D, R, N, C, E, Q, G, H, I,L,K,M, F, S,T,W, Y,V; 

G14 7A, D,R,N,C,E,Q,H, I,L,K,M, F, P,S,T,W, Y,V; 
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P4 5 9A,D,R,N,C,E,Q,G,H, I,L,K,M, F, S , T, W, Y, V; 
G4 60A,D,R,N,C,E,Q,H, I,L,K,M, F, P, S, T, W, Y, V; 
T461A,D / R,N,C,E,Q,G,H, I, L, K, M, F, P, S,W, Y, V; 
V4 62A, D,R,N,C,E,Q,G,H, I,L,K,M, F, P, S, T, W, Y; 
5 T4 63A, D,R,N,C,E,Q,G,H, I,L,K,M, F, P,S,W, Y,V. 

Preferred are variants having one or more of the following 
substitutions or deletions: 

K142R; S193P; N195F; K269R,Q; N270Y,R,D; K311R; E346Q; K385R; 
K458R; P459T; T461P; Q174*; R181Q,N,S; G182T,S,N; D183*; G184*; 
10 K185A, R, D,C,E,Q,G, H, I,L,M,N, F, P, S , T, W, Y, V; A18 6T, S , N, I , V, R; 
189T, S,N,Q. 

Especially preferred are variants having a deletion in 
positions D183 and G184 and further one or more of the following 
substitutions or deletions: 
15 K142R; S193P; N195F; K269R / Q; N270Y,R,D; K311R; E346Q; K385R; 
K458R; P459T; T461P; Q174*; R181Q,N,S; G182T,S,N; 
K185A,R,D,C,E,Q,G,H, I,L,M,N # F, P,S,T,W,Y,V; A18 6T, S, N, I , V, R; 
W189T, S,N,Q. 

The variants of the invention mentioned above exhibits an 
20 alteration in at least one of the following properties relative 
to the parent cx-amylase: 

i) improved pH stability at a pH from 8 to 10.5; and/or 

ii) improved Ca 2+ stability at pH 8 to 10.5, and/or 

iii) increased specific activity at temperatures from 10 to 60°C, 

25 preferably 20-50°C, especially 30-40°C. Further, details will be 

described below. 

The invention further relates to DNA constructs encoding 

variants of the invention; to methods for preparing variants of 

the invention; and to the use of variants of the invention, 
30 alone or in combination with other enzymes, in various 

industrial products or processes, e.g., in detergents or for 

starch liquefaction. 

In a final aspect the invention relates to a method of 

providing ct-amylases with altered pH optimum, and/or altered 
35 temperature optimum, and/or improved stability. 

Nomenclature 

In the present description and claims, the conventional one- 
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letter and three-letter codes for amino acid residues are used. 

For ease of reference, a-amylase variants of the invention are 
described by use of the following nomenclature: 
Original amino acid ( s ): position (s ): substituted amino acid(s) 
5 According to this nomenclature, for instance the substitution 

of alanine for asparagine in position 30 is shown as: 

Ala30Asn or A30N 

a deletion of alanine in the same position is shown as: 
Ala30* or A30* 
10 and insertion of an additional amino acid residue, such as 
lysine, is shown as: 

Ala30AlaLys or A30AK 

A deletion of a consecutive stretch of amino acid residues, 
such as amino acid residues 30-33, is indicated as (30-33)* or 
15 A(A30-N33) . 

Where a specific a-amylase contains a "deletion" in 
comparison with other a- amylases and an insertion is made in 
such a position this is indicated as: 

*36Asp or *36D 
20 for insertion of an aspartic acid in position 36 

Multiple mutations are separated by plus signs, i.e.: 

Ala30Asp + Glu34Ser or A30N+E34S 

representing mutations in positions 30 and 34 substituting 
alanine and glutamic acid for asparagine and serine, 
25 respectively. 

When one or more alternative amino acid residues may be 
inserted in a given position it is indicated as 

A30N,E or 

A30N or A30E 

30 Furthermore, when a position suitable for modification is 

identified herein without any specific modification being 
suggested, it is to be understood that any amino acid residue 
may be substituted for the amino acid residue present in the 
position. Thus, for instance, when a modification of an alanine 

35 in position 30 is mentioned, but not specified, it is to be 
understood that the alanine may be deleted or substituted for 
any other amino acid, i.e., any one of: 
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R,N, D, A,C,Q, E,G, H, I, L, K,M, F, P, S, T, W, Y, V. 

BRIEF DESCRIPTION OP THE DRAWINGS 

Figure 1 is an alignment of the amino acid sequences of six 
5 parent Termamyl-like cc-amylases. The numbers on the extreme left 
designate the respective amino acid sequences as follows: 
1: SEQ ID NO: 2 
2 : Kaoamyl 
3: SEQ ID NO: 1 
10 4 : SEQ ID NO: 5 
5: SEQ ID NO: 4 
6: SEQ ID NO: 3. 

Figure 2 shows the temperature activity profile of SP722 (SEQ 
ID NO: 2) (at pH 9) and B. licheniformis ct-amylase (SEQ ID NO: 
15 4) (at pH 7.3) . 

Figure 3 shows the temperature profile for SP690 (SEQ ID NO: 

1), SP722 (SEQ ID NO: 2), B. licheniformis ot-amylase (SEQ ID NO: 
4) at pH 10. 

Figure 4 is an alignment of the amino acid sequences of five 
a-amylases. The numbers on the extreme left designate the 
respective amino acid sequences as follows: 
1 : amyp_mouse 
2 : amyp_rat 

3: amyp_pig porcine pancreatic alpha-amylase (PPA) 
25 4 : amyp_human 

5: amy_altha A. haloplanctis alpha-amylase (AHA) 

DETAILED DISCLOSURE OF THE INVENTION 

The Termamyl-like a-amylase 

30 It: is well known that a number of a-amylases produced by 

Bacillus spp. are highly homologous on the amino acid level. For 
instance, the B. licheniformis a-amylase comprising the amino 
acid sequence shown in SEQ ID NO: . 4 (commercially available as 
Termamyl™) has been found to be about 89% homologous with the B. 

35 am ylolique faciens ^ a-amylase comprising the amino acid sequence 
shown in SEQ ID NO: 5 jand about 79% homologous with the B. 
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stearothermophilus a-amylase comprising the amino acid sequence 
shown in SEQ ID NO: 3. Further homologous a-amylases include an 
a-amylase derived from a strain of the Bacillus sp. NCIB 12289, 
NCIB 12512, NCIB 12513 or DSM 9375, all of which are described 
in detail in WO 95/26397, and the a-amylase described by 
Tsukamoto et al . , Biochemical and Biophysical Researc h 
Communications , 151 (1988), pp. 25-31, (see SEQ ID NO: 6). 

Still further homologous a-amylases include the a-amylase 
produced by the B . licheniformis strain described in EP 0252666 
(ATCC 27811) , and the a-amylases identified in WO 91/00353 and 
WO 94/18314. Other commercial Termamyl-like B . licheniformis 
a-amylases are comprised in the products Optitherm™ and 
Takatherm™ (available from Solvay) , Maxamyl™ (available from 
Gist-brocades/Genencor) , Spezym AA™ and Spezyme Delta AA™ 
(available from Genencor) , and Keistase™ (available from Daiwa) . 

Because of the substantial homology found between these a- 
amylases, they are considered to belong to the same class of a- 
amylases, namely the class of "Termamyl-like a-amylases". 

Accordingly, in the present context, the term "Termamyl-like 
20 a-amylase" is intended to indicate an a-amylase which, at the 
amino acid level, exhibits a substantial homology to Termamyl™, 
i.e., the B. licheniformis a-am vlase having the amino acid 
sequence shown in SEQ ID JNO:4 here in. In other words, all the 
following a-amylases which has the amino acid sequences shown in 
25 SEQ ID NOS: 1, 2, 3, 4, 5, 6, 7 or 8 herein, or the amino acid 
sequence shown in SEQ ID NO: 1 of WO 95/26397 (the same as the 
amino acid sequence shown as SEQ ID NO: 7 herein) or in SEQ ID 
NO: 2 of WO 95/26397 (the same as the amino acid sequence shown 
as SEQ ID NO: 8 herein) or in Tsukamoto et al., 1988, (which 
amino acid sequence is shown in SEQ ID NO: 6 herein) are 
considered to be "Termamyl-like a-amylase". Other Termamyl-like 
a-amylases are a-amylases i) which displays at least 60%, such 
as at least 70%, e.g., at least 75%, or at least 80%, e.g., at 
least 85%, at least 90% or at least 95% homology with at least 
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one of said amino acid sequences shown in SEQ ID NOS: 1-8 and/or 
ii) displays immunological cross-reactivity with an antibody 
raised against at least one of said a-amylases, and/or iii) is 
encoded by a DNA sequence which hybridizes to the DNA sequences 
encoding the above-specified a-amylases which are apparent from 
SEQ ID NOS: 9, 10, 11, or 12 of the present application (which 
encoding sequences encode the amino acid sequences shown in SEQ 
ID NOS: 1, 2, 3, 4 and 5 herein, respectively), from SEQ ID NO: 
4 of WO 95/26397 (which DNA sequence, together with the stop 
codon TAA, is shown in SEQ ID NO: 13 herein and encodes the 
amino acid sequence shown in SEQ ID NO: 8 herein) and from SEQ 
ID NO: 5 of WO 95/26397 (shown in SEQ ID NO: 14 herein), 
respectively. 

In connection with property i) , the "homology" may be 
determined by use of any conventional algorithm, preferably by 
use of the GAP progamme from the GCG package version 7.3 (June 
1993) using default values for GAP penalties, which is a GAP 
creation penalty of 3.0 and GAP extension penalty of 0.1, 
(Genetic Computer Group (1991) Programme Manual for the GCG 
Package, version 7, 575 Science Drive, Madison, Wisconsin, . USA 
53711) . 

A structural alignment between Termamyl (SEQ ID NO: 4) and a 
Termamyl-like a-amylase may be used to identify equiva- 
lent/corresponding positions in other Termamyl-like a-amylases. 
One method of obtaining said structural alignment is to use the 
Pile Up programme from the GCG package using default values of 
gap penalties, i.e., a gap creation penalty of 3.0 and gap 
extension penalty of 0.1. other, structural alignment methods 
include the hydrophobic cluster analysis (Gaboriaud et al . , 
30 (1987), FEBS LETTERS 224, pp. 149-155) and reverse threading 
(Huber, T ; Torda, AE, PROTEIN SCIENCE Vol. 7, No. 1 pp . 142- 
149 (1998) . 

Property ii) of the a-amylase, i.e., the immunological 
cross reactivity, may be assayed using an antibody raised 
35 against, or reactive with, at least one epitope of the relevant 
Termamyl-like a-amylase. The antibody, which may either be 
monoclonal or polyclonal, may be produced by methods known in 
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the art, e.g., as described by Hudson et al., Practical 
Immunology, Third edition (198*), Blackwell Scientific 
Publications. The immunological cross-reactivity may be determi- 
ned using assays known in the art, examples of which are Western 
5 Blotting or radial immunodiffusion assay, e.g., as described by 
Hudson et al., 1989. In this respect, immunological cross- 
reactivity between the oc-amylases having the amino acid 
sequences SEQ ID NOS : 1, 2, 3, 4, 5, 6, 7, or 8, respectively, 
has been found. 

10 The oligonucleotide probe used in the characterisation of the 

Termamyl-like a-amylase in accordance with property iii) above 
may suitably be prepared on the basis of the full or partial 
nucleotide or amino acid sequence of the a-amylase in question. 
Suitable conditions for testing hybridisation involve pre- 

15 soaking in 5xSSC and prehybridizing for 1 hour at ~~40°C in a 
solution of 20% formamide, . SxDenhardt's solution, 50mM sodium 
phosphate, pH 6.8, and 50mg of denatured sonicated calf thymus 
DNA, followed by hybridisation in the same solution supplemented 
with lOOmM ATP for 18 hours at -40°C, followed by three times 

20 washing of the filter in 2xSSC, 0.2% SDS at 40°C for 30 minutes 
(low stringency), preferred at 50°C (medium stringency), more 
preferably at 65°C (high stringency) , even more preferably at 
~75°C (very high stringency). More details about the 
hybridisation method can be found in Sambrook et al.. Molecular 

25 Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor, 1989. 

In the present context, "derived from" is intended not only 
to indicate an a-amylase produced or producible by a strain of 
the organism in question, but also an a-amylase encoded by a DNA 
sequence isolated from such strain and produced in a host or- 

30 ganism transformed with said DNA sequence. Finally, the term is 
intended to indicate an a-amylase which is encoded by a DNA 
sequence of synthetic and/or cDNA origin and which has the 
identifying characteristics of the a-amylase in question. The 
term is also intended to indicate that the parent a-amylase may 

35 be a variant of a naturally occurring a-amylase, i.e. a variant 
which is the result of a modification (insertion, substitution, 
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deletion) of one or more amino acid residues of the naturally 
occurring a-amylase. 

Parent hybrid a-amylases 

The parent a-amylase (i.e., backbone a-amylase) may be a 
hybrid a-amylase, i.e., an a-amylase which comprises a combi- 
nation of partial amino acid sequences derived from at least two 
a-amylases . 

The parent hybrid a-amylase may be one which on the basis of 
amino acid homology and/or immunological cross-reactivity and/or 
DNA hybridization (as defined above) can be determined to belong 
to the Termamyl-like a-amylase family, in this case, the hybrid 
a-amylase is typically composed of at least one part of a 
Termamyl-like a-amylase and part(s) of one or more other a- 
amylases selected from Termamyl-like a-amylases or non-Termamyl- 

like a-amylases of microbial (bacterial or fungal) and/or 
mammalian origin. 

Thus, the parent hybrid a-amylase may comprise a combination 
of partial amino acid sequences deriving from at least two 
Termamyl-like a-amylases, or from at least one Termamyl-like and 
at least one non-Termamyl-like bacterial a-amylase, or from at 
least one Termamyl-like and at least one fungal a-amylase. The 
Termamyl-like a-amylase from which a partial amino acid sequence 

derives may, e.g., be any of those specific Termamyl-like a- 
25 amylase referred to herein. 

For instance, the parent a-amylase may comprise a C-terminal 
part of an a-amylase derived from a strain of B. licheniformis, 
and a N-terminal part of an a-amylase derived from a strain of 
B. amyloliquefaciens or from a strain of B. stearothermophilus. 
For instance, the parent a-amylase may comprise at least 430 
amino acid residues of the C-terminal part of the B. 
licheniformis a-amylase, and may, e.g., comprise a) an amino 
acid segment corresponding to the 37 N-terminal amino acid 
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residues of the B. amyloliquefaciens a-amylase having the amino 
acid sequence shown in SEQ ID NO: 5 and an amino acid segment 
corresponding to the 445 Oterminal amino acid residues of the 
B. licheniformis a-amylase having the amino acid sequence shown 
5 in SEQ ID NO: 4, or 

a hybrid Termamyl-like a-amylase being identical to the 
Termamyl sequence, i.e., the Bacillus licheniformis a-amylase 
shown in SEQ ID NO: ^ jxcept that the N-terminal 35 amino acid 
residues (of the mature protein) has been replaced by the N- 

10 terminal 33 residues of BAN (mature protein), i.e., the 
Bacillus amyloliquefaciens a-amylase shown in SEQ ID NO: 5; or 
b) an amino acid segment corresponding to the 68 N-terminal 
amino acid residues of the B. stearothermophilus a- amylase 
having the amino acid sequence shown in SEQ ID NO: 3 and an 

15 f- *~amTno acicT segmentT"~£rorresponding to the 415 C-terminal amino 
ratfid residues of the B. licheni formis a-am ylase having the amino 
acid sequence shown in "SEQ ID NO: 4. 

Another s u i t a b 1 e^— pax eTnT™Ky lDFfcT" " a - anryi a s e is the one 
previously described in WO 96/23874 (from Novo Nordisk) 

20 constituting the N-terminus of BAN, Bacillus amyloliquefaciens 
a-amylase (amino acids 1-300 of the mature protein) and the C- 
terminus from Termamyl (amino acids 301-483 of the mature 
protein) . Increased activity was achieved by substituting one or 
more of the following positions of the above hybrid a-amylase 

25 (BAN: 1-300/Termamyl : 301-483) : Q360, F290, and N102. Particularly 
interesting substitutions are one or more of the following 
substitutions: Q360E, D; F290A,C, D,E,G,H, I,K,L,M,N, P,Q, R, S,T; 
N102D,E; 

The corresponding positions in the SP722 a-amylase shown in 
30 SEQ ID NO: 2 are one or more of: S365, Y295, N106. Corresponding 
substitutions of particular interest in said a-amylase shown in 
SEQ ID NO: 2 are one or more of: S365D,E; Y295 
A,C,D,E,G,H,I,K,L,M,N,P,Q,R,S,T; and N106D,E. 

The corresponding positions in the SP690 a-amylase shown in 
35 SEQ ID NO: 1 are one or more of: S365, Y295, N106. The 
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corresponding substitutions of particular interest are one or 
more of: S365D, E; Y295 A,C / D,E,G,H,I,K,L,M,N,P,Q,R,S,T; N106D,E. 

The above mentioned non-Termamyl-like ot-amylase may, e.g., be 
a fungal a-amylase, a mammalian or a plant a-amylase or a 
bacterial a-amylase (different from a Termamyl-like a-amylase) . 
Specific examples of such a-amylases include the Aspergillus 
oryzae TAKA a-amylase, the A. niger acid a-amylase, the Bacillus 
subtilis a-amylase, the porcine pancreatic a-amylase and a 
barley a-amylase. All of these a-amylases have elucidated 
structures which are markedly different from the structure of a 
typical Termamyl-like a-amylase as referred to herein. 

The fungal a-amylases mentioned above, i.e., derived from A. 
niger and A. oryzae, are highly homologous on the amino acid 
level and generally considered to belong to the same family of 
a-amylases. The fungal a-amylase derived from Aspergillus oryzae 
is commercially available under the tradename Fungamyl™. 

. Furthermore, when a particular variant of a Termamyl-like a- 
amylase (variant of the invention) is referred to - in a 
conventional manner - by reference to modification (e.g., 
deletion or substitution) of specific amino acid residues in the 
amino acid sequence of a specific Termamyl-like a-amylase, it is 
to be understood that variants of another Termamyl-like a- 
amylase modified in the equivalent position (s) (as determined 
from the best possible amino acid sequence alignment between the 
respective amino acid sequences) are encompassed thereby. 

In a preferred embodiment of the invention the a-amylase 
backbone is derived from B. licheniformis (as the parent 
Termamyl-like a-amylase), e.g., one of those referred to above, 
such as the B. licheniformis a-amylase having the amino acid 
sequence shown in SEQ ID NO: 4. 

Altered properties of variants of the invention 

The following discusses the relationship between mutations 
. which are present in variants of the invention, and desirable 
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alterations in properties (relative to those a parent Termamyl- 
like a-amylase) which may result therefrom. 

Improved stability at pH 8-10.5 
5 In the context of the present invention, mutations (including 

amino acid substitutions) of importance with respect to 
achieving improved stability at high pH (i.e., pH 8-10.5) 
include mutations corresponding to mutations in one or more of 
the following positions in SP722 a-amylase (having the amino 
10 acid sequence shown in SEQ ID NO: 2) : T141, K142, F143, D144, 
F145, P146, G147, 'R148, G149, R181, A186, S193, N195, K269, 
N270, K311, K458, P459, T461. 

The variant of the invention have one or more of the 
following substitutions (using the SEQ ID NO: 2 numbering): 



T141A, 


D, 


R, 


N, 


c, 


E, 


Q,G,H, 


I, 


L, 


K, 


M, F, 


P, 


s, 


W, Y, 


V; 


K142A, 


D, 


R, 


N, 


c, 


E, 


Q,G,H, 


I, 


L, 


M, 


F, P, 


s, 


T, 


W, Y, 


V; 


F143A, 


D, 


R, 


N, 


c, 


E, 


Qf G, H, 


I, 


L, 


K, 


M, P, 


s, 


T, 


W, Y, 


V; 


D144A, 


R/ 


N, 


c, 


E, 


Q, 


G,H, I, 


L, 


K, 


M, 


F, P, 


s, 


T, 


W, Y, 


V; 


F145A, 


D, 


R/ 


N, 


c, 


E, 


Q/ G, H, 


I, 


L, 


K, 


M, P, 


s, 


T, 


W, Y, 


V; 


P14 6A, 


D, 


R, 


N, 


C, 


E, 


Q,G,H, 


I, 


L, 


K, 


M, F, 


s, 


T, 


W, Y, 


V; 


G147A, 


D, 


R/ 


N, 


c, 


E, 


Q,H,I, 


L, 


K, 


Mr 


F,P, 


s, 


T, 


W, Y, 


V; 


R148A, 


D, 


N, 


c, 


E, 


Q, 


G,H,I, 


L, 


K, 


M, 


F,P, 


s, 


T, 


W, Y, 


V; 


G149A, 


D, 


R/ 


N, 


c, 


E, 


Q,H,I, 


L, 


K, 


M, 


F,P, 


s, 


T, 


W, Y, 


V; 


K181A, 


D, 


R, 


N, 


c, 


E, 


Q, G, H, 


I, 


L, 


M, 


F,P, 


s, 


T, 


W, Y, 


V; 


A186D, 


R, 


N, 


c, 


E, 


Q, 


G,H,I, 


L, 


P, 


K, 


M, F, 


s, 


T, 


W, Y, 


V; 


S193A, 


D, 


R, 




c, 


E, 


r Q/ G, H, 


I, 


L, 


K, 


M, F, 


p, 


T, 


,W, Y, 


. V; 


N195A, 


D, 


R. 


c, 


, E, 


Q, 


, G, I , 


L, 


K, 


M, 


F,P, 


s, 


T, 


,W,Y ( 


■ V; 


K269A, 


D, 


R< 




-C, 


E, 


t Q, G, H, 


I, 


,L, 


M, 


F,P, 


s, 


T, 


-W, Y, 


-V; 


N270A, 


D, 


R, 


c, 


-E, 


Q 


,G>H,I, 


L, 


,K, 


M, 


F,P, 


s, 


T, 


-W, Y ( 


-V; 


K311A, 


D, 


R, 


N 


rC, 


E 


r Q/ G, H, 


I, 


r L, 


- M, 


F, P, 


s, 


T, 


- W, Y, 


- V; 


K458A, 


D, 


- R« 


,N 


,c, 


E 


,Q,G,H, 


-I, 


-L ( 


M t 


F,P, 


s, 


T, 


rW, Y, 


-V; 


P459A, 


- D, 


r R/ 


,N 




- E 


r Qf G, H, 


- 1 1 


r L ( 


> K, 


, M, F, 


s, 


-T; 


-W, Y, 


r V; 


T461A, 


, D 


- R 


rN 




r E 


,Q,G,H, 


r I, 


, L, 


-K, 


, M, F, 


. p. 


- S 


,W, Y, 


rV. 



Preferred high pH stability variants include one or more of 
35 the following substitutions in the SP722 a-amylase (having the 
amino acid sequence shown in SEQ ID NO: 2) : 

K142R, R181S, A186T, S193P, N195F, K269R, N270Y, K311R, K458R, 
P459T and T461P. 
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In specific embodiment s the Bacillus strain NCIB 12512 a- 
amylase having the sequence shown in SEQ ID NO: 1, or the S. 
stearothermophilvs a-amylase having the sequence shown in SEQ ID 
NO: 3, or the B. licheniformis a-amylase having the sequence 
shown in SEQ ID NO: 4, or the B. amyloliquefaciens a-amylase 
having the sequence shown in SEQ ID NO: 5 is used as the 
backbone, i.e., parent Termamyl-like a-amylase, for these 
mutations. 

As can been seen from the alignment in Figure 1 the B. 
stearothermophilus a-amylase already has a Tyrosine at position 
corresponding to N270 in SP722. Further, the Bacillus strain 
NCIB 12512 a-amylase, the B. stearothermophilus a-amylase, the 
B. licheniformis a-amylase and the B. amyloliquefaciens a- 
amylase already have Arginine at position corresponding to K458 
in SP722. Furthermore, the B. licheniformis a-amylase already 
has a Proline at position corresponding to T461 in SP722. 

Therefore, for said a-amylases these substitutions are not 
relevant . 

. a-amylase variants with improved stability at high pH can be 
constructed by making substitutions in the regions found using 
the molecular dynamics simulation mentioned in Example 2. The 
simulation depicts the region (s) that has a higher flexibility 
or mobility at high pH (i.e., pH 8-10.5) when compared to medium 
pH. 

By using the structure of any bacterial alpha-amylase with 
homology (as defined below) to the Termamyl-like a-amylase 
(BA2), of which the 3D structure is disclosed in Appendix 1 of 
WO 96/23874 (from Novo Nordisk) , it is possible to modelbuild 
the structure of such alpha-amylase and to subject it to 
molecular dynamics simulations. The homology of said bacterial 
a-amylase may be at least 60%, preferably be more than 70%, more 
preferably more than 80%, most preferably more than 90% 
homologous to the above mentioned Termamyl-like a-amylase (BA2), 
measured using the UWGCG GAP program from the GCG package 
version 7.3 (June 1993) using default values for GAP penalties 
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[Genetic Computer Group (1991) Programme Manual for the GCG 
Package, version 7, 575 Science Drive, Madison, Wisconsin, USA 
53711]. Substitution of the unfavorable residue for another 
would be applicable. 

Improved Ca 2 * stability at pH 8-10.5 

Improved Ca 2+ stability means the stability of the enzyme 
under Ca 2+ depletion has been improved. In the context of the 
present invention, mutations {including amino acid 
substitutions) of importance with respect to achieving improved 
Ca 2+ stability at high pH include mutation or deletion in one or 
more positions corresponding to the following positions in the 
SP722 ct-amylase having the amino acid sequence shown in SEQ ID 
NO: 2: R181, G182, D183, G184, K185, A186, W189, N195, N270, 
E346, K385, K458, P459. 

A variant of the invention have one or more of the following 
substitutions or deletions: 

R181*,A, D,N,C,E,Q,G,H, I,L,K,M, F, P,S,T,W,Y,V; 
G182*,A, D,R,N,C,E,Q,H, I,L,K,M, F, P, S„ T, W, Y, V; 
D183*,A, R,N,C,E,Q,G,H, I,L,K,M, F, P, S, T, W, Y, V; 
G184*,A,R,D,N,C, E, Q,H, I,L,K,M, F, P,S,T,W, Y, V; 
K185A, D, R,N,C,E,Q,G,H, I,L,M, F, P,S,T,W, Y, V; 
A186D,R,N,C,E,Q,G,H, I,L,K,M, F, P,S,T,W,Y,V; 
W189A,D,R,N,C,E,Q,G,H, I,L,K,M, F,P,S,T, Y, V; 
N195A, D,R,C,E,Q,G,H, I,L,K,M, F, P, S, T, W, Y, V; 
N270A,R, D,N,C,E,Q,H, I,L,K,M, F, P, S, T, W, Y, V; 
E346A,R, D,N,C,Q,G,H, I,L,K,M, F, P , S , T , W , Y , V ; 
K385A,R, D,N,C,E,Q,G,H, I,L,M, F, P, S, T, W, Y, V; 
K458A,R, D,N,C,E,Q,G,H, I,L,M, F, P,S,T,W,Y,V; 
P459A,R, D,N,C,E,Q,G,H, I , L, K, M, F, S, T , W, Y f V . 

Preferred are variants having one or more of the following 
substitutions or deletions: 
R181Q,N; G182T,S,N; D183*; G184*; 

K185A,R, ^CE.QjCH, I,L,M,N, F, P,S,T,W, Y, V; A186T, S, N, I , V; 
W189T,S,N,Q; N195F, N270R, D; E346Q; K385R; K458R; P459T. 

In specific embodiments the Bacillus strain NCIB 12512 a- 
amylase having the sequence shown in SEQ ID NO: 1, or the B. 
amyloliquefaciens cc-amylase having the sequence shown in SEQ ID 
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NO: 5, or the B. licheniformis a-amylase having the sequence 
shown in SEQ ID NO: 4 are used as the backbone for these 
mutations . 

As can been seen from the alignment in Figure 1 the B. 
licheniformis a-amylase does not have the positions 
corresponding to D183 and G184 in SP722. Therefore for said a- 
amylases these deletions are not relevant. 

In a preferred embodiment the variant is the Bacillus strain 
NCIB 12512 a-amylase with deletions in D183 and G184 and further 
one • of the following substitutions: R181Q,N and/or G182T,S,N 
and/or D183*; G184* and/or 

K185A, R, D, C,E,Q,G,H, I , L, M, N, F, P, S , T, W, Y, V and/or A186T, S, N, I , V 
and/or W189T,S,N,Q and/or N195F and/or N270R,D and/or E346Q 
and/or K385R and/or K458R and/or P459T. 



Increased specific activity at medium temperature 

In a further aspect of the present invention, important 
mutations with respect to obtaining variants exhibiting 
~: increased specific activity at temperatures from 10-60°C, 
20 preferably 20-50°C, especially 30-40°C, include mutations 
corresponding to one or more of the following positions in the 
SP722 a-amylase having the amino acid sequence shown in SEQ ID 
NO: 2: 

H107, K108, G109, D166, W167, D168, Q169, S170, R171, Q172, 
25 F173, Q174, D183, G184, N195, F267, W268, K269, N270, D271, 
L272, G273, A274, L275, G456, N457, K458, P459, G460, T461, 
V462, T463. 

The variant of the invention have one or more of the 

following substitutions: 
30 H107A, D,R,N,C,E,Q ( G,I,L,K,M,F, P,S,T,W, Y,V; 

K108A, D,R,N,C,E,Q,G,H, I,L,M,F, P,S,T,W, Y,V; 

G109A, D,R,N,C,E,Q,H,I,L,K,M,F, P,S,T,W,Y,V; 

DieSA^^N^^^Q^^, I,L,K,M,F, P,S,T,W, Y,V; 

W167A, D,R,N,C,E,Q,G,H, I , L, K, M, F, P,S,T, Y,V; 
35 D168A, R,N,C,E,Q,G,H, I,L,K,M, F, P,S,T,W,Y,V; 

Q169A, D,R,N,C,E,G,H, I,L,K,M, F, P, S, T, W, Y, V; 
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S170A 
R171A 
Q172A 
F173A 
Q174* 
D183* 
G184* 
N195A 
F267A 
W268A 
K269A 
N270A 
D271A 
L272A 
G273A 
A274D 
L275A 
G456A 
N457A 
K458A 
P459A 
G460A 
T461A 
V4 62A 
T4 63A 
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M, F 
F, P 
F, P 
M, P 
M, F 
K, M 
M, F 
F, P 
M, P 
M, F 
F, P 
F, P 
F, P 
F, P 
F, P 
F, P 
F, P 
F, P 
F, P 
F, P 
M, F 
F, P 
M, F 
M, F 
M, F 
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P 
S 



,T,W, Y, V 
,T, W, Y, V 
S,T, W, Y,V 
S, T, W, Y, V 
P,S,T, W, Y, V 
F, P, S, W, Y, V 
P,S,T,W, Y, V 
S,T,W, Y,V 
S,T,W, Y,V 
P, S, T, Y, V 
S,T,W, Y, V 
S,T,W, Y,V 
S, T, W, Y, V 
S, T, W, Y, V, 
S,T, W, Y, V 
S, T, W, Y, V 
S,T,W, Y,V 
S,T,W, Y,V 
S,T,W, Y,V 
S,T,W, Y,V 
S, T, W, Y, V 
S,T,W, Y, V 
P,S,W, Y,V 
P,S,T,W, Y 
P,S, W, Y,V. 
one or more 



of the following 



30 



substitutions or deletions: Q174*, D183*, G184*, K269S. 

In a specific embodiment the B. licheniformis a-amylase 
having the sequence shown in SEQ ID NO: 4 is used as the 
backbone for these mutations. 



35 



General mutations in variants of the invention: increased 
specific activity at medium temperatures 

The particularly interesting amino acid substitution are 
those that increase the mobility around the active site of the 
enzyme. This is accomplished by changes that disrupt stabilizing 
interaction in the vicinity of the active site, i.e., within 
preferably 10A or 8A or 6A or 4A from any of the residues 
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constituting the active site. 

Examples are mutations that reduce the size of side chains, 
such as 
Ala to Gly, 
5 Val to Ala or Gly, 

lie or Leu to Val, Ala, or Gly 
Thr to Ser 

Such mutations are expected to cause increased flexibility in 
■the active site region either by the introduction of cavities 
10 or by the structural rearrangements that fill the space left by 
the mutation. 

It may be preferred that a variant of the invention comprises 
one or more modifications in addition to those outlined above. 
Thus, it may be advantageous that one or more Proline residues 

15 present in the part of the a-amylase variant which is modified 
is/are replaced with a non-Proline residue which may be any of 
the possible, naturally occurring non-Proline residues, and 
which preferably is an Alanine, Glycine, Serine, Threonine, 
Valine or Leucine. 

20 ^ Analogously, it may be preferred that one or more Cysteine 
residues present among the amino acid residues with which the 
parent a-amylase is modified is/are replaced with a non-Cysteine 
residue such as Serine, Alanine, Threonine, Glycine, Valine or 
Leucine . 

25 Furthermore, a variant of the invention may - either as the 

only modification or in combination with any of the above 
outlined modifications - be modified so that one or more Asp 
and/or Glu present in an amino acid fragment corresponding to 
the amino acid fragment 185-209 of SEQ ID NO: 4 is replaced by 

30 an Asn and/or Gin, respectively. Also of interest is the 
replacement, in the Termamyl-like a-amylase, of one or more of 
the Lys residues present in an amino acid fragment corresponding 
to the amino acid fragment 185-209 of SEQ ID NO: 4 by an Arg. 

It will be understood that the present invention encompasses 

35 variants incorporating two or more of the above outlined 
modifications . 

Furthermore, it may be advantageous to introduce point- 
mutations in any of the variants described herein. 
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a -amylase variants having increased mobility around the active 
site : 

The mobility of a-amylase variants of the invention may be 
5 increased by replacing one or more amino acid residue at one or 
more positions close to the substrate site. These positions are 
(using the SP722 a-amylase (SEQ ID NO: 2) numbering) : V56, K108, 
D168, Q169,.Q172, L201, K269, L272, L275, K446, P459. 

Therefore, in an aspect the invention relates to variants 
10 being mutated in one or more of the above mentioned positions. 

Preferred substitutions are one or more of the following: 
V56A,G,S,T; 

K108A, D,E,Q,G,H, I,L,M,N,S,T, V; 
D168A,G, I, V,N, S,T; 
15 Q169A, D, G, H, I, L,M,N, S,T, V; 
Q17 2A, D,G, H, I, L,M,N, S,T, V; 
L201A,G,I,V,S,T; 

K2 69A, D, E, Q, G, H, I,L,M,N,S,T,V; 

L272A, G, I,V,S,T; 
20 L275A,G,I,V,S,T; 

Y2 95A, D, E, Q, G, H, IjL/M/NfF/S/T/V; 

K4 4 6A, D, E, Q, G, H, I,L,M,N,S,T,V; 

P459A,G,I,L,S,T,V. 

In specific embodiments of the invention the Bacillus strain 
25 NCIB 12512 a-amylase having the sequence shown in SEQ ID NO: 1, 

or the B. stearothermophilus a-amylase having the sequence shown 

in SEQ ID NO: 3, or the B. licheniformis a-amylase having the 

sequence shown in SEQ ID NO: 4, or the B. amyloliquefaciens a- 

amylase having the sequence shown in SEQ ID NO: 5 are used as 
30 the backbone for these mutations. 

As can been seen from the alignment in Figure 1 the B. 

licheniformis a-amylase and the B. amyloliquefaciens a-amylase 

have a Glutamine at position corresponding to K269 in SP722. 

Further, the B. stearothermophilus a-amylase has a Serine at 
35 position corresponding to K269 in SP722. Therefore, for said a- 

amylases these substitutions are not relevant. 

Furthermore, as can been seen from the alignment in Figure 1 
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the B. amyloliquefaciens a-amylase has an Alanine at position 
corresponding to L272 in SP722, and the B. stearothermophilus cc- 
amylase has a Isoleucine at the position corresponding to L272 
in SP722. Therefore, for said a-amylases these substitutions are 
5 not relevant. 

: As can been seen from the alignment in Figure 1, the Bacillus 
strain 12512 a-amylase has a Isoleucine at position 
corresponding to L27 5 in SP722 . Therefore for said a-amylase 
this substitution is not relevant. 

10 As can been seen from the alignment in Figure 1 the B . 

amyloliquefaciens a-amylase has a Phenylalanine at position 
corresponding to Y295 in SP722. Further, the B. 
stearothermophilus a-amylase has an Asparagine at position 
corresponding to Y295 in SP722. Therefore, for said a-amylases 

15 these substitutions are not relevant. 

As can been seen from the alignment in Figure 1 the B . 
licheniformis a-amylase and the B. amyloliquefaciens a-amylase 
have a Asparagine at position corresponding to K446 in SP722. 
Further, the . B. stearothermophilus a-amylase has a Histidine at 

20 position corresponding to K446 in SP722. Therefore, for said a- 
amylases these substitutions are not relevant. 

As can been seen from the alignment in Figure 1 the B. 
licheniformis a-amylase, the B. amyloliquefaciens a-amylase and 
the B. stearothermophilus a-amylase have a Serine at position 

25 corresponding to P459 in SP722. Further, the Bacillus strain 
12512 a-amylase has a Threonine at position corresponding to 
P459 in SP722. Therefore, for said a-amylases these 
substitutions are not relevant. 

30 Stabilization of enzymes having high activity at medium 
temperatures 

In a further embodiment the invention relates to improving 
the stability of low temperature a-amylases (e.g, Alteromonas 
haloplanctis (Feller et al., (1994), Eur. J. Biochem 222:441- 
35 447), and medium temperature a-amylases (e.g., SP722 and SP690) 
possessing medium temperature activity, i.e., commonly known as 
psychrophilic enzymes and mesophilic enzymes. The stability can 
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for t: is particular enzyme class be unde .tood either as 
thermostability or the stability at Calcium depletion 
conditions. 

Typically, enzymes displaying the high activity at medium 
5 temperatures also display severe problems under conditions that 
stress the enzyme, such as temperature or Calcium depletion. 

Consequently, the objective is to provide enzymes that at the 
same time display the desired high activity at medium 
temperatures without loosing their activity under slightly 
10 stressed conditions. 

The activity of the stabilized variant measured at medium 
temperatures should preferably be between 100% or more and 50%, 
and more preferably between 100% or more and 70%, and most 
preferably between 100% or more and 85% of the original activity 
15 at that specific temperature before stabilization of the enzyme 
and the resulting enzyme should withstand longer incubation at 
stressed condition than the wild type enzyme. 

Contemplated enzymes include a-amylases of, e.g., bacterial 
or fungal origin. 

20 An example of such a low temerature a-amylase is the one 

isolated from Alteromonas haloplanctis (Feller et al., (1994), 
Eur. J. Biochem 222:441-447). The crystal structure of this 
alpha-amylase has been solved (Aghajari et al., (1998), Protein 
Science 7:564-572). 

25 The A. haloplanctis alpha-amylase (5 in alignment shown in 

Fig. 4) has a homology of approximately 66% to porcine 
pancreatic alpha-amylase (PPA) (3 in the alignment shown in Fig. 
4) . The PPA 3D structure is known, and can be obtained from 
Brookhaven database under the name 10SE or 1DHK. Based on the 

30 homology to other more stable alpha amylases, stabilization of 
"the low temperature highly active enzyme" from Alteromonas 
haloplanctis alpha-amylase, can be obtained and at the same time 
retaining the desired high activity at medium temperatures. 

Figure 4 shown a multiple sequence alignments of five a- 

35 amylases, including the AHA and the PPA a-amylase. Specific 
mutations giving increased stability in Alteromonas haloplantis 
alpha-amylase : 

T66P, Q69P, R155P, Q177R, A205P, A232P, L243R, V295P, S315R. 
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Methods for preparing a -amylase variants 

Several methods for introducing mutations into genes are 
known in the art. After a brief discussion of the cloning of oc- 
5 amylase-encoding DNA sequences, methods for generating mutations 
at specific sites within the a-amylase-encoding sequence will be 
discussed. 

Cloning a DNA sequence encoding an gc -amylase 

10 The DNA sequence encoding a parent a-amylase may be isolated 

from any cell or microorganism producing the a-amylase in 
question, using various methods well known in the art. First, a 
genomic DNA and/or cDNA library should be constructed using 
chromosomal DNA or messenger RNA from the organism that produces 

15 the a-amylase to be studied. Then, if the amino acid sequence of 
the a-amylase is known, homologous, labeled oligonucleotide 
probes may be synthesized and used to identify a-amylase- 
encoding clones from a genomic library prepared from the 
Qrganism in question. Alternatively, a labeled oligonucleotide 

20 probe containing sequences homologous to a known a-amylase gene 
could be used as a probe to identify a-amylase-encoding clones, 
using hybridization and washing conditions of lower stringency. 

Yet another method for identifying a-amylase-encoding clones 
would involve inserting fragments of genomic DNA into an ex- 

25 pression vector, such as a plasmid, transforming a-amylase- 
negative bacteria with the resulting genomic DNA library, and 
then plating the transformed bacteria onto agar containing a 
substrate for a-amylase, thereby allowing clones expressing the 
a-amylase to be identified. 

30 Alternatively, the DNA sequence encoding the enzyme may be 

prepared synthetically by established standard methods, e.g., 
the phosphoroamidite method described by S.L. Beaucage and M.H. 
Caruthers (1981) or the method described by Matthes et al. 
(1984) . In the phosphoroamidite method, oligonucleotides are 

35 synthesized, e.g., in an automatic DNA synthesizer, purified, 
annealed, ligated and cloned in appropriate vectors. 
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Finally, L..- DNA sequence may be of mixed genomic and synthe- 
tic origin, mixed synthetic and cDNA origin jr mixed genomic and 
cDNA origin, prepared by ligating fragments of synthetic, 
genomic or cDNA origin (as appropriate, the fragments 
corresponding, to various parts of the entire DNA sequence), in 
accordance with standard techniques. The DNA sequence may also 
be prepared by polymerase chain reaction (PCR) using specific 
primers, for instance as described in US 4,683,202 or R.K. Saiki 
et al. (1988) . 



Expression of a -amylase variants 

According to the invention, a DNA sequence encoding the 
variant produced by methods described above, or by any alterna- 
tive methods known in the art, can be expressed, in enzyme form, 
using an expression vector which typically includes control 
sequences encoding a promoter, operator, ribosome binding site, 
translation initiation signal, and, optionally, a repressor gene 
or various activator genes. 

The recombinant expression vector carrying the DNA sequence 
encoding an a-amylase variant of the invention may be any vector 
which may conveniently be subjected to recombinant DNA 
procedures, and the choice of vector will often depend on the 
host cell into which it is to be introduced. Thus, the vector 
may be an autonomously replicating vector, i.e., a vector which 
exists as an extrachromosomal entity, the replication of which 
is independent of chromosomal replication, e.g., a plasmid, a 
bacteriophage or an extrachromosomal element, minichromosome or 
an artificial chromosome. Alternatively, the vector may be one 
which, when introduced into a host cell, is integrated into the 
host cell genome and replicated together with the chromosome ( s ) 
into which it has been integrated. 

In the vector, the DNA sequence should be operably connected 
to a suitable promoter sequence. The promoter may be any DNA 
sequence which shows transcriptional activity in the host cell 
35 of choice and may be derived from genes encoding proteins either 
homologous or heterologous to the host cell. Examples of 
suitable promoters for directing the transcription of the DNA 
sequence encoding an a-amylase variant of the invention, 
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sequence which shows transcriptional activity in the host cell 
of choice and may be derived from genes encoding proteins either 
homologous or heterologous to the host cell. Examples of 
suitable promoters for directing the transcription of the DNA 

5 sequence encoding an a-amylase variant of the invention, 
especially in a bacterial host, are the promoter of the lac 
operon of E.coli, the Streptomyces coelicolor agarase gene dagh 
promoters, the promoters of the Bacillus licheniformls a-amylase 
gene (amyL) , the promoters of the Bacillus stearothermophilus 

10 maltogenic amylase gene (amyM) , the promoters of the Bacillus 
amyloliquefaciens a-amylase (amyQ) , the promoters of the Ba- 
cillus subtilis xylA and xylB genes etc. For transcription in a 
fungal host, examples of useful promoters are those derived from 
the gene encoding A . oryzae TAKA amylase, Rhizomucor miehei 

15 aspartic proteinase, A. niger neutral a-amylase, A. niger acid 
stable a-amylase, A. niger glucoamylase, Rhizomucor miehei 
lipase, A. oryzae alkaline protease, A. oryzae triose phosphate 
isomerase or A. nidulans acetamidase. 

The expression vector of the invention may also comprise a 

20 suitable transcription terminator and, in eukaryotes, poly- 
adienylation sequences operably connected to the DNA sequence 
encoding the a-amylase variant of the invention. Termination and 
polyadenylation sequences may suitably be derived from the same 
sources as the promoter. 

25 The vector may further comprise a DNA sequence enabling the 

vector to replicate in the host cell in question. Examples of 
such sequences are the origins of replication of plasmids pUC19, 
PACYC177, pUBllO, pE194, pAMBl and pIJ702. 

The vector may also comprise a selectable marker, e.g. a gene 

30 the product of which complements a defect in the host cell, such 
as the dal genes from B. subtilis or B. licheniformis, or one 
which confers antibiotic resistance such as ampicillin, 
kanamycin, chloramphenicol or tetracyclin resistance. Fur- 
thermore, the vector may comprise Aspergillus selection markers 

35 such as amdS, argB, niaD and sC, a marker giving rise to 
hygromycin resistance, or the selection may be accomplished by 
co-transformation, e.g., as described in WO 91/17243. 
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While intracellular expression may be advantageous in some 
respects, e.g., when using certain bacteria as host cells, it is 
generally preferred that the expression is extracellular. In 

general, the Bacillus oc-amylases mentioned herein comprise a 

5 preregion permitting secretion of the expressed protease into 
the culture medium. If desirable, this preregion may be replaced 
by a different preregion or signal sequence, conveniently accom- 
plished by substitution of the DNA sequences encoding the 
respective prefegions . 

10 The procedures used to ligate the DNA construct of the inven- 

tion encoding an a-amylase variant, the promoter, terminator and 
other elements, respectively, and to insert them into suitable 
vectors containing the information necessary for replication, 
are well known to persons skilled in the art (cf., for instance, 

15 Sambrook et al., Molecular Cloning: A Laboratory Manual , 2nd 
Ed., Cold Spring Harbor, 1989). 

The cell of the invention, either comprising a DNA construct 
or an expression vector of the invention as defined above, is 
advantageously used as a host cell in the recombinant production 

20 of an a-amylase variant of the invention. The cell may be 
transformed with the DNA construct of the invention encoding the 
variant, conveniently by integrating the DNA construct (in one 
or more copies) in the host chromosome. This integration is 
generally considered to be an advantage as the DNA sequence is 

25 more likely to be stably maintained in the cell. Integration of 
the DNA constructs into the host chromosome may be performed 
according to conventional methods, e.g., by homologous or 
heterologous recombination. Alternatively, the cell may be 
transformed with an expression vector as described above in 

30 connection with the different types of host cells. 

The cell of the invention may be a cell of a higher organism 
such as a mammal or an insect, but is preferably a microbial 
cell, e.g. a bacterial or a fungal (including yeast) cell. 

Examples of suitable bacteria are. Gram positive bacteria such 

35 as Bacillus subtilis, Bacillus licheniformis, Bacillus lentus, 
Bacillus brevis r Bacillus stearothermophilus , Bacillus alkalo- 
philus, Bacillus amyloliquefaciens, Bacillus coagulans, Bacillus 
circulans, Bacillus lautus, Bacillus megaterium, Bacillus 
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thuringiensis, or Streptomyces lividans or Streptomyces murinus, 
or gramnegative bacteria such as E.coli. The transformation of 
the bacteria may, for instance, be effected by protoplast trans- 
formation or by using competent cells in a manner known per se. 
5 The yeast organism may favorably be selected from a species 

of Saccharomyces or Schizosaccharomyces , e.g. Saccharomyces 
cerevisiae. The filamentous fungus may advantageously belong to 
a species of Aspergillus, e.g., Aspergillus oryzae or Aspergil- 
lus niger. Fungal cells may be transformed by a process involv- 
10 ing protoplast formation and transformation of the protoplasts 
followed by regeneration of the cell wall in a manner known per 
se. A suitable procedure for transformation of Aspergillus host 
cells is described in EP 238 023. 

In a yet further aspect, the present invention relates to a 
method of producing an cc-amylase variant of the invention, which 
method comprises cultivating a host cell as described above 
under conditions conducive to the production of the variant and 
recovering the variant from the cells and/or culture medium. 

.. The medium used to cultivate the cells may be any convention- 
al medium suitable for growing the host cell in question and 
obtaining expression of the a-amylase variant of the invention. 
Suitable media are available from commercial suppliers or may be 
prepared according to published recipes (e.g. as described in 
catalogues of the American Type Culture Collection) . 

The a-amylase variant secreted from the host cells may con- 
veniently be recovered from the culture medium by well-known 
procedures, including separating the cells from the medium by 
centrifugation or filtration, and precipitating proteinaceous 
components of the medium by means of a salt such as ammonium 
sulphate, followed by the use of chromatographic procedures such 
as ion exchange chromatography, affinity chromatography, or the 
like . 



20 
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Industrial Applications 

The a-amylase variants of this invention possesses valuable 
properties allowing for a variety of industrial applications. In 
particular, enzyme variants of the invention are applicable as a 
component in washing, dishwashing and hard-surface cleaning 
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detergent compositions. 

Numerous variants are particularly useful in the production 
of sweeteners and ethanol from starch, and/or for textile 
desizing. Conditions for conventional starch- conversion 
processes, including starch liquefaction and/or saccharif ication 
processes, are described in, e.g., US 3,912,590 and in EP patent 
publications Nos. 252,730 and 63,909. 

Detergent compositions 

As mentioned above, variants of the invention may suitably be 
incorporated in detergent compositions. Reference is made, for 
example, to WO 96/23874 and WO 97/07202 for further details 
concerning relevant ingredients of detergent compositions (such 
as laundry or dishwashing detergents), appropriate methods of 
formulating the variants in such detergent compositions, and for 
examples of relevant types of detergent compositions. 

Detergent compositions comprising a variant of the invention 
may additionally comprise one or more other enzymes, such as a 
lipase, cutinase, protease, cellulase, peroxidase or laccase, 
20 and/or another a-amylase. 

a-amylase variants of the invention may be incorporated in 
detergents at conventionally employed concentrations. It is at 
present contemplated that a variant of the invention may be 
incorporated in an amount corresponding to 0.00001-1 mg 
(calculated as pure, active enzyme protein) of a-amylase per 
liter of wash/dishwash liquor using conventional dosing levels 
of detergent. 

The invention also relates to a method of providing a- 
amylases with 1) altered pH optimum, and/or 2) altered 
temperature optimum, and/or 3) improved stability, comprising 
the following steps : 

i) identifying (a) target position (s) and/or region (s) for 
mutation of the a-amylase by comparing the molecular dynamics of 
two or more a-amylase 3D structures having substantially 
different pH, temperature and/or stability profiles, 

ii) substituting, adding and/or deleting one or more amino acids 
in the identified position(s) and/or region(s). 
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In embodiment of the invention a medium temperature a- 
amylase is compared with a high temperature a-amylase. In 
another embodiment a low temperature a-amylase is compared with 
either a medium or a high temperature a-amylase. 

The a-amylases compared should preferably be at least 70%, 
preferably 80%, up to 90%, such as up to 95%, especially 95% 
homologous with each other. 

The a-amylases compared may be Termamyl-like a-amylases as 
defined above. In specific embodiment the a-amylases compared 
are the a-amylases shown in SEQ ID NO: 1 to SEQ ID NO: 8. 

In another embodiment the stability profile of the a-amy- 
lases in question compared are the Ca 2+ dependency profile. 

MATERIALS AND METHODS 

15 Enzymes : 

SP722: (SEQ ID NO: 2, available from Novo Nordisk) 
Termamyl™ (SEQ ID NO: 4, available from Novo Nordisk) 
SP690: (SEQ ID NO: 1, available from Novo Nordisk) 

20 Bacillus subtilis SHA273: see WO 95/10603 

Plasmids 

pJEl contains the gene encoding a variant of SP722 a-amylase 
(SEQ ID NO: 2): viz. deletion of 6 nucleotides corresponding to 
amino acids D183-G184 in the mature protein. Transcription of 
the JE1 gene is directed from the amyL promoter. The plasmid 
further more contains the origin of replication and cat-gene 
conferring resistance towards kanamycin obtained from plasmid 
pUBHO (Gryczan, TJ et al. (1978), J. Bact. 134:318-329). 
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Methods : 

Construction of library vector pDorKlOl 

The E. coli/Bacillus shuttle vector pDorKlOl (described 
below) can be used to introduce mutations without expression of 
a-amylase in E. coli and then be modified in such way that the 
a-amylase is active in Bacillus. The vector was constructed as 
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follows: The JE1 encoding gene (SP722 with the deletion of 
D183-G184) was inactivated in pJEl by gene interruption in the 
PstI site in the 5'coding region of the SEQ ID NO: 2: SP722 by 
a 1.2 kb fragment containing an E. coli origin of replication. 
5 This fragment was PCR amplified from the pUC19 (GenBank 
Accession #:X02514) using the forward primer: 5'- 
gacctgcagtcaggcaacta-3 ' and the reverse primer: 5'- 
tagagtcgacctgcaggcat-3 ' . The PCR amplicon and the pJEl vector 
were digested with PstI at 37°C for 2 hours. The pJEl vector 
10 fragment and the PCR fragment were ligated at room temperature, 
for 1 hour and transformed in E . coli by electrotransf ormation . 
The resulting vector is designated pDorKlOl. 

Filter screening assays 

15 The assay can be used to screening of Termamyl-like <x- 

amylase variants having an improved stability at high pH 
compared to the parent enzyme and Termamyl-like a-amylase 
variants having an improved stability at high pH and medium 
temperatures compared to the parent enzyme depending of the 

20 screening temperature setting 

High pH filter assay 

Bacillus libraries are plated on a sandwich of cellulose 
acetate (OE 67, Schleicher & Schuell, Dassel, Germany) - and 
25 nitrocellulose filters (Protran-Ba 85, Schleicher & Schuell, 
Dassel, Germany) on TY agar plates with 10 \xg/ml kanamycin at 

37°C for at least 21 hours. The cellulose acetate layer is 
located on the TY agar plate. 

Each filter sandwich is specifically marked with a needle 

30 after plating, but before incubation in order to be able to 
localize positive variants on the filter and the nitrocellulose 
filter with bound variants is transferred to a container with 
glycin-NaOH buffer, pH 8.6-10.6 and incubated at room 
temperature (can be altered from 10°-60°C) for 15 min. The 

35 cellulose acetate filters with colonies are stored on the TY- 
plates at room temperature until use. After incubation, 
residual activity is detected on plates containing 1% agarose, 
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0.2% starch in glycin-NaOH buffer, pH 8.6-10.6. The assay 
plates with nitrocellulose filters are marked the same way as 
the filter sandwich and incubated for 2 hours. at room 
temperature. After removal of the filters the assay plates are 
5 stained with 10% Lugol solution. Starch degrading variants are 
detected as white spots on dark blue background and then 
identified on the storage plates. Positive variants are 
rescreened twice under the same conditions as the first screen. 

10 Low calcium filter assay 

The Bacillus library are plated on a sandwich of 
cellulose acetate (OE 67, Schleicher & Schuell, Dassel, 
Germany) - and nitrocellulose filters (Protran-Ba 85, 
Schleicher & Schuell, Dassel, Germany) on TY agar plates with a 
15 relevant antibiotic, e.g., kanamycin or chloramphenicol, at 37°C 
for at least 21 hours. The cellulose acetate layer is located 
on the TY agar plate. 

Each filter sandwich is specifically marked with a needle 
after plating, but before incubation in order to be able to 
localize positive variants on the filter and the nitrocellulose 
filter with bound variants is transferred to a container with 
carbonate/bicarbonate buffer pH 8.5-10 and with different EDTA 
concentrations (0.001 mM - 100 mM) . The filters are incubated 
at room temperature for 1 hour. The cellulose acetate filters 
with colonies are stored on the TY-plates at room temperature 
until use. After incubation, residual activity is detected on 
plates containing 1% agarose, 0.2% starch in 
carbonate/bicarbonate buffer pH 8.5-10. The assay plates with 
nitrocellulose filters are marked the same way as the filter 
30 sandwich and incubated for 2 hours, at room temperature. After 
removal of the filters the assay plates are stained with 10% 
Lugol solution. Starch degrading variants are detected as white 
spots on dark blue background and then identified on the 
storage plates. Positive variants are rescreened twice under 
35 the same conditions as the first screen. 

Method for obtaining the regions of interest: 

There are three known 3D structures of bacterial oc- 
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amylases. Two of B . licheniformis a-amylase, Brookhaven 
database 1BPL (Machius et al. (1995), J. Mol . Biol. 246, p. 
545-559) and 1VJS (Song et al. (1996), Enzymes for Carbohydrate 
163 Engineering (Prog. Biotechnol . V 12). These two structures 
5 are lacking an important piece of the structure from the so- 
called B-domain, in the area around the two Calcium ions and 
one Sodium ion binding sites. We have therefore used a 3D 
structure of an oc-amylase BA2 (WO 96/23874 which are a hybrid 
between BAN™ (SEQ ID NO. 5) and B. licheniformis a-amylase (SEQ 
10 ID NO. 4). On basis of the structure a model of B. 
licheniformis alpha amylase and the SP722 a-amylase has been 
build . 

Fermentation and purification of a -amylase variants 
15 Fermentation and purification may be performed by methods 

well known in the art. 

Stability determination 

All stability trials are made using, the same set up. The 
20 method are: 

The enzyme is incubated under the relevant conditions (1-4). 
Samples are taken at various time points, e.g., after 0, 5, 10, 
15 and 30 minutes and diluted 25 times (same dilution for all 
taken samples) in assay buffer (0.1M 50mM Britton buffer pH 

25 7.3) and the activity is measured using the Phadebas assay 
(Pharmacia) under standard conditions pH 7.3, 37 °C. 

The activity measured before incubation (0 minutes) is used 
as reference (100%) . The decline in percent is calculated as a 
function of the incubation time. The table shows the residual 

30 activity after, e.g., 30 minutes of incubation. 

Specific activity determination 

The specific activity is determined using the Phadebas 
assay (Pharmacia) as activity/mg enzyme. The manufactures 
35 instructions are followed (see also below under "Assay for a- 
amylase activity) . 
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Assays for ^-Amylase Activity 

1. Phadebas assay 

cc-amylase activity is determined by a method employing 
Phadebas® tablets as substrate. Phadebas tablets (Phadebas® 
Amylase Test, supplied by Pharmacia Diagnostic) contain a cross- 
linked insoluble blue-colored starch polymer which has been 
mixed with bovine serum albumin and a buffer substance and 
tabletted . 

For every single measurement one tablet is suspended in a 
tube containing 5 ml 50 mM Britton-Robinson buffer (50 mM acetic 
acid, 50 mM phosphoric acid, 50 mM boric acid, 0.1 mM CaCl 2 , pH 
adjusted to the value of interest with NaOH) . The test is 
performed in a water bath at the temperature of interest. The cc- 
amylase to be tested is diluted in x ml of 50 mM Britton- 
Robinson buffer. 1 ml of this cc-amylase solution is added to the 
5 ml 50 mM Britton-Robinson buffer. The starch is hydrolyzed by 
the cc-amylase giving soluble blue fragments.' The absorbance of 
the resulting blue solution, measured spectrophotometrically at 
620 nm, is a function of the cc-amylase activity. 

It is important that the measured 620 nm absorbance after 10 
or 15 minutes of incubation (testing time) is in the range of 
0.2 to 2.0 absorbance units at 620 nm. In this absorbance range 
there is linearity between activity and absorbance (Lambert-Beer 
law) . The dilution of the enzyme must therefore be adjusted to 
fit this criterion. Under a specified set of conditions (temp., 
pH, reaction time, buffer conditions) 1 mg of a given oc-amylase 
will hydrolyze a certain amount .of substrate and a blue colour 
will be produced. The colour intensity is measured at 620 nm. 
The measured absorbance is directly proportional to the specific 
activity (activity/mg of pure cc-amylase protein) of the cc- 
amylase in question under the given set of conditions. 

2. Alternative method 

cc-amylase activity is determined by a method employing the 
PNP-G7 substrate. PNP-G7 which is a abbreviation for p- 
35 nitrophenyl-cc,D-maltoheptaoside is a blocked oligosaccharide 
which can be cleaved by an endo-amylase . Following the cleavage, 
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the a-Glucosidase included in the kit digest the substrate to 
liberate a free PNP molecule which has a yellow colour and thus 
can be measured by visible spectophometry at X.=405nm. (400-420 
nm.). Kits containing PNP-G7 substrate and a-Glucosidase is 
5 manufactured by Boehringer-Mannheim (cat. No. 1054635) . 

To prepare the substrate one bottle of substrate (BM 
1442309) is added to 5 ml buffer (BM1442309) . To prepare the a- 
Glucosidase one bottle of a-Glucosidase (BM 1462309) is added to 
45 ml buffer (BM1442309) . The working solution is made by mixing 
10 5 ml a-Glucosidase solution with 0 . 5 ml substrate. 

The assay is performed by transforming 20ul enzyme solution 
to a 96 well microtitre plate and incubating at 25°C. 200 ul 
working solution, 25°C is added. The solution is mixed and pre- 
incubated 1 minute and absorption is measured every 15 sec. over 
15 3 minutes at OD 405 nm. 

The slope of the time dependent absorption-curve is directly 
proportional to the specific activity (activity per mg enzyme) 
of the a-amylase in question under the given set of conditions. 

20 General method for random mutagenesis by use of the DO PE 
program 

The random mutagenesis may be carried out by the following 
steps: 

1. Select regions of interest for modification in the parent 
25 enzyme 

2. Decide on mutation sites and non-mutated sites in the 
selected region 

3. Decide on which kind of mutations should be carried out, 
e.g. with respect to the desired stability and/or performance 

30 of the variant to be constructed 

4. Select structurally reasonable mutations. 

5. Adjust the residues selected by step 3 with regard to step 
4. 

6. Analyze by use of a suitable dope algorithm the nucleotide 
35 distribution. 

7. If necessary, adjust the wanted residues to genetic code 
realism (e.g., taking into account constraints resulting from 
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the genetic code (e.g. in order to avoid introduction of stop 
codons))(the skilled person will be aware that some codon 
combinations cannot be used in practice and will need to be 
adapted) 

8. Make primers 

9. Perform random mutagenesis by use of the primers 

10. Select resulting a-amylase variants by screening for the 
desired improved properties. 

■' Suitable dope algorithms for use in step 6 are well known 
in. the art. One algorithm is described by Tomandl, D. et al., 
Journal of Computer-Aided Molecular Design, 11 (1997), pp . 29- 
38). Another algorithm, DOPE, is described in the following: 

The dope program 

15 The "DOPE" program is a computer algorithm useful to 

optimize the nucleotide composition of a codon triplet in such 
a way that it encodes an amino acid distribution which 
resembles most the wanted amino acid distribution. In order to 
assess which of the possible distributions is the most similar 

20 to the wanted amino acid distribution, a scoring function is 
needed. In the "Dope" program the following function was found 
to be suited: 
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i-y, 



where the x ( . ' s are the obtained amounts of amino acids and 
groups of amino acids as calculated by the program, v, ' s are 
the wanted amounts of amino acids and groups of amino acids as 
defined by the user of the program (e.g. specify which of the 
20 amino acids or stop codons are wanted to be introduced, e.g. 
with a certain percentage (e.g. 90% Ala, 3% He, 7% Val) , and 
w., >s are assigned weight factors as defined by the user of the 
program (e.g., depending on the importance of having a specific 
amino acid residue inserted into the position in question) . N 
is 21 plus the number of amino acid groups as defined by the 
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user of the program. For purposes of this function 0° is 
defined be:: 

A Monte-Carlo algorithm (one example being the one 
described by Valleau, J. P. & Whittington, S.G. (1977) A guide 
to Mont Carlo - for statistical mechanics: 1 Highways- In 
"Stastistical Mechanics, Part A" Equlibrium Techniqeues ed. 
B.J. Berne, New York: Plenum) is used for finding the maximum 
value of this function. In each iteration the following steps 
are performed: 

1. A new random nucleotide composition is chosen for each base, 
where the absolute difference between the current and the new 
composition is smaller than or equal to d for each of the 
four nucleotides G,A,T,C in all three positions of the codon 
(see below for definition of d) . 

2. The scores of the new composition and the current composition 
are compared by the use of the function s as described above. 
If the new score is higher or equal to the score of the 
current composition, the new composition is kept and the 
current composition is changed to the new one. If the new 

20 score is smaller, the probability of keeping the new 

composition is exp(\000(new_score -current _score)) . 

A cycle normally consists of 1000 iterations as described 
above in which d is decreasing linearly from 1 to 0 . One 
hundred or more cycles are performed in an optimization 
25 process. The nucleotide composition resulting in the highest 
score is finally presented. 

EXAMPLES 



15 



30 EXAMPLE 1 

Example on Homology building of Termamyl™ 

The overall homology of the B. licheniformis ot-amylase (in 
the following referred to as Termamyl™) to other Termamyl-like 
cx-amylases is high and the percent similarity is extremely high. 
35 The similarity calculated for Termamyl™ to BSG (the B. 
stearothermophilus a-amylase having SEQ ID NO: 3), and BAN (the 
B. amyloliquefaciens a-amylase having SEQ ID NO: 5) using the 
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University of Wisconsin Genetics Computer Group's program GCG 
gave 89% and 78%, respectively. TERM has a deletion of 2 
residues between residue G180 and K181 compared to BAN™ and BSG. 
BSG has a deletion of 3 residues between G371 and 1372 in 
comparison with BAN™ and Termamyl™. Further BSG has a C-terminal 
extension of more than 20 residues compared to BAN™ and 
Termamyl™. BAN™ has 2 residues less and Termamyl has one' residue 
less in the N-terminal compared to BSG. 

The structure of the B. licheniformis (Termamyl™) and of 
the B. amyloliquefaciens a-amylase (BAN™) , respectively, was 
model built on the structure disclosed in Appendix 1 of WO 
96/23974. The structure of other Termamyl-like oc-amylases (e.g. 
those disclosed herein) may be built analogously. 

In comparison with the a-amylase used for elucidating the 
15 present structure, Termamyl™ differs in that it lacks two 
residues around 178-182. In order to compensate for this in the 
model structure, the HOMOLOGY program from BIOSYM was used to 
substitute the residues in equivalent positions in the structure 
(not only structurally conserved regions.) except for the dele- 
20 tion point. A peptide bond was established between G179(G177) 
and K180(K180) in Termamyl™ (BAN™) . The close structural 
relationship between the solved structure and the model 
structure (and thus the validity of the latter) is indicated by 
the presence of only very few atoms found to be too close 
25 together in the model. 

To this very rough structure of Termamyl™ was then added 
all waters (605) and ions (4 Calcium and 1 Sodium) from the 
solved structure (See Appendix 1 of WO 96/23874) at the same 
coordinates as for said solved structure using the INSIGHT 
program. This could be done with only few overlaps - in other 
words with a very nice fit. This model structure were then 
minimized using 200 steps of Steepest descent and 600 steps of 
Conjugated gradient (see Brooks et al 1983, J. Computational 
Chemistry 4, p. 187-217). The minimized structure was then sub- 
jected to molecular dynamics, 5ps heating followed by up to 
200ps equilibration but more than 35ps. The dynamics as run with 
the Verlet algorithm and the equilibration temperature 300K were 
kept using the Behrendsen coupling to a water bath (Berendsen 
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et. al., 1984, J. Chemical Physics 81, p. 3684-3690). Rotations 

and translations were removed every pico second. 

EXAMPLE 2 

5 Method of extracting important regions for identifying g- 
amylase variants with improved pH stability and altered 
temperature activity 

The X-ray structure and/or the model build structure of 
the enzyme of interest, here SP722 and Termamyl™, are subjected 

10 to molecular dynamics simulations. The molecular dynamics 
simulation are made using the CHARMM (from Molecular 
simulations (MSI)) program or other suited program like, e.g., 
DISCOVER (from MSI) . The molecular dynamic -analysis is made in 
vacuum, or more preferred including crystal waters, or with the 

15 enzyme embedded in water, e.g., a water sphere or a water box. 
The simulation are run for 300 pico seconds (ps) or more, e.g., 
300-1200 ps. The isotropic fluctuations are extracted for the 
CA carbons of the structures and compared between the 
structures. Where the sequence has deletions and/or insertions 

20 the isotropic fluctuations from the other structure are 
inserted thus giving 0 as difference in isotropic fluctuation. 
For explanation of isotropic fluctuations see the CHARMM manual 
(obtainable from MSI) . 

The molecular dynamics simulation can be done using 

25 standard charges on the chargeable amino acids. This is Asp and 
Glu are negatively charged and Lys and Arg are positively 
charged. This condition resembles the medium pH of 
approximately 7. To analyze a higher or lower pH, titration of 
the molecule can be done to obtain the altered pKa's of the 

30 standard titrateable residues normally within pH 2-10; Lys, 
Arg, Asp, Glu, Tyr and His. Also Ser, Thr and Cys are 
titrateable but are not taking into account here. Here the 
altered charges due to the pH has been described as both Asp 
and Glu are negative at high pH, and both Arg and Lys are 

35 uncharged. This imitates a pH around 10 to 11 where the 
titration of Lys and Arg starts, as the normal pKa of these 
residues are around 9-11. 
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1. The approach used for extracting important regions for 
identifying a-amylase variants with high pH stability: 

The important regions for constructing variants with 
improved pH stability are the regions which at the extreme pH 
5 display the highest mobility, i.e., regions having the highest 
isotropic fluctuations . 

Such regions are identified by carrying out two molecular 
dynamics simulations: i) a high pH run at which the basic 
amino acids, Lys and Arg, are seen as neutral (i.e. not 
10 protonated) and the acidic amino acids, Asp and Glu, have the 
charge (-1) and ii) a neutral pH run with the basic amino 
acids, Lys and Arg, having the net .charge of (+1) and the 
acidic amino acids having a charge of (-1) . 

The two run are compared and regions displaying the 
15 relatively higher mobility at high pH compared to neutral pH 
analysis were identified. 

Introduction of residues improving general stability, e.g., 
hydrogen bonding, making the region more rigid (by mutations 
such-- as Proline substitutions or replacement of Glycine 
20 residues) , or improving the charges or their interaction, 
improves the high pH stability of the enzyme. 

2. The approach used for extracting regions for identifying 
a-amylase variants with increased activity at medium 
temperatures : 

25 . The important regions for constructing variants with 

increased activity at medium temperature was found as the 
difference between the isotropic fluctuations in SP722. and 
Termamyl, i.e., SP722 minus Termamyl CA isotrophic 
fluctuations, The regions with the highest mobility in the 

30 isotrophic fluctuations were selected. These regions and there 
residues were expected to increase the activity at medium 
temperatures. The activity of an alpha-amylase is only 
expressed if the correct mobility of certain residues are 
present. If the mobility of the residues is too low the 

35 activity is decreased or abandoned. 
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EXAMPLE 3 

Construction, by localized random, doped mutagenesis/ of 
Termamyl-like ^-amylase variants having an improved Ca2+ 
stability at medium temperatures compared to the parent enzyme 
5 To improve the stability at low calcium concentration of a- 

amylases random mutagenesis in pre-selected region was 
performed . 

Region: Residue: 
SAI: R181-W189 

10 The DOPE software (see Materials and Methods) was used to 

determine spiked codons for each suggested change in the SAI 
region minimizing the amount of stop codons (see table 1). The 
exact distribution of nucleotides was calculated in the three 
positions of the codon to give the suggested population of 

15 amino acid changes. The doped regions were doped specifically 
in the indicated positions to have a high chance of getting the 
desired residues, but still allow other possibilities. 

Table 1: 

20 Distribution of amino acid residues for each position 
R181: 72% R, 2% N, 7% Q, 4% H, 4%K, 11%S 
G182: 73% G, 13% A, 12% S, 2% T 
K185: 95% K, 5% R 

A186: 50% A, 4% N, 6% D, 1%E, 1% G, 1% K, 5% S, 31% T 
25 W187: 100% W 
D188: 100% D 
W189: 92% W, 8% S 

The resulting doped oligonucleotide strand is shown in 
30 table 2 as sense strand: with the wild type nucleotide and 
amino acid sequences and the distribution of nucleotides for 
each doped position. 



Table 2: 
35 Position 

Amino acid seq. 
Wt nuc. seq. 



181 182 185 186 187 188 189 
Arg Gly Lys Ala Thr Asp Thr 
cga ggt aaa get tgg gat tgg 



40 Forward primer (SEQ ID NO: 15): 

FSA: 5'-caa aat cgt ate tac aaa ttc 123 456 a7g 8910 tgg 

gat tllg gaa gta gat teg gaa aat-3' 

Distribution of nucleotides for each doped Position 
45 1: 35% A, 65% C 
2: 83% G, 17% A 



1NSDOCID: <WO 992321 1 At _1_> 



WO 99/23211 



PCT/DK98/00471 - 



41 

3: 63% G, 37% T 
4: 86% G, 14% A 
5: 85% G, 15% C 
6: 50% T, 50% C 
5 7: 95% A, 5%G 

8: 58% G, 37% A, 5% T 
9: 86% C, 13% A, 1% G 
10: 83% T, 17% G 
11: 92% G, 8% C 

10 

Reverse primer (SEQ ID NO: 16) : 

RSA: 5'-gaa ttt gta gat acg att ttg-3' 

Random mutagenesis 

15 The spiked oligonucleotides apparent from Table 2 (which by 

a common term is designated FSA) and reverse primers RSA for 
the SA1 region and specific SEQ ID NO: 2: SP722 primers 
covering the SacII and the Drain sites are used to generate 
PCR-library-fragments by the overlap extension method (Horton 

20 et al., Gene, 77 (1989), pp. 61-68) with an overlap of 21 base 
pairs. Plasmid pJEl is template for the Polymerase Chain 
Reaction. The PCR fragments are cloned in the E. coll/ Bacillus 
shuttle vector pDorklOl (see Materials and Methods) enabling 
mutagenesis in E. coll and immediate expression in Bacillus 

25 subtilis preventing lethal accumulation of amylases in E. coli. 
After establishing the cloned PCR fragments in E. coli, a 
modified pUC19 fragment is digested out of the plasmid and the 
promoter and the mutated Termamyl gene is physically connected 
and expression can take place in Bacillus. 

30 

Screening 

The library may be screened in the low calcium filter 
assays described in the "Material and Methods" section above. 

35 EXAMPLE 4 

Construction of variants of amylase SEQ ID NO: 1 (SP69 0) 

The gene encoding the amylase from SEQ ID NO: 1 is located 

in a plasmid pTVB106 described in W096/23873. The amylase is 

expressed from the amyL promoter in this construct in Bacillus 
40 sujbtilis. 

A variant of the protein is delta (T183-G184 ) 

+Y243F+Q391E+K444Q. Construction of this variant is described 
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in W096/23873. 

Construction of delta (T183-G184 ) + N195F by the mega-primer 
method as described by Sarkar and Sommer, (1990), BioTechniques 
8: 404-407. 

5 Gene specific primer Bl (SEQ ID NO: 17) and mutagenic 

primer 101458 (SEQ ID NO: 19) were used to amplify by PCR an 
approximately 645 bp DNA fragment from a pTVB106-like plasmid 
(with the delta (T183-G184 ) mutations in the gene encoding the 
amylase from SEQ ID NO: 1) . 

10 The 645 bp fragment was purified from an agarose gel and 

used as a mega-primer together with primer Y2 (SEQ ID NO: 18) 
in a second PCR carried out on the same template. 

The resulting approximately 1080 bp fragment was digested 
with restriction enzymes BstEII and Afllll and the resulting 

15 approximately 510 bp DNA fragment was purified and ligated with 
the pTVB106-like plasmid (with the delta (T183-G184 ) mutations 
in the gene encoding the amylase from SEQ ID NO: 1) digested 
with the same enzymes. Competent Bacillus subtilis SHA273 
(amylase and protease low) cells were transformed with the 

20 ligation and Chlorampenicol resistant transf ormants and was 
checked by DNA sequencing to verify the presence of the correct 
mutations on the plasmid* 
primer Bl: (SEQ ID NO: 17) 
5' CGA TTG CTG ACG CTG TTA TTT GCG 3' 

25 primer Y2: (SEQ ID NO: 18) 

5' CTT GTT CCC TTG TCA GAA CCA ATG 3' 
primer 101458 (SEQ ID NO: 19): 

5' GT CAT AGT TGC CGA AAT CTG TAT CGA CTT C 3' 

The construction of variant: delta (T183-G184 ) + K185R+A186T 
30 was carried out in a similar way except that mutagenic primer 
101638 was used . 
primer 101638: (SEQ ID NO: 20) 

5' CC CAG TCC CAC GTA CGT CCC CTG AAT TTA TAT ATT TTG 3' 

Variants: delta (T18 3-G184) +A18 6T, delta (T183-G184 ) +A18 6I, 
35 delta(T183-G184) +A186S, delta (T183-G184 ) +A186N are 
constructed by a similar method except that pTVB106-like 
plasmid (carrying variant delta (T18 3-G184 ) + K185R+A186T) is 
used as template and as the vector for the cloning purpose. The 
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mutagenic oligonucleotide (Oligo 1) is: 

5' CC CAG TCC CAG NTCTTT CCC CTG AAT TTA TAT ATT TTG 3' (SEQ ID 
NO: 21) 

N represents a mixture of the four bases: A, C, G, and T 
5 used in the synthesis of the mutagenicoli-gonucleotide . 

Sequencing of transf ormants identifies the correct codon for 
amino acid position 186 in the mature amylase. 

Variant: delta (T18 3-G18 4) + K18 5R+A18 6T+N195F is 

constructed as follows: 

10 PCR is carried out with primer x2 (SEQ ID NO: 22) and primer 
101458 (SEQ ID NO: 19) on pTVB106-like plasmid (with mutations 
delta(T183-G184) + K185R+A18 6T ) . The resulting DNA fragment is 
used as a mega-primer together with primer Y2 (SEQ ID NO: 18) 
in a PCR on pTVB106-like plasmid (with mutations delta (T183- 

15 G184) + N195) . The product of the second PCR is digested with 
restriction endonucleases Acc65I and Afllll and cloned into 
pTVB106 like plasmid (delta (T183-G184 ) +N1 95F) digested with the 
same enzymes. 

primer x2 : (SEQ ID NO: 22) 

20 5'~GCG TGG ACA AAG TTT GAT TTT CCT G 3' 

Variant: delta (T183-G184) + K185R+A186T+N195F+Y243F+ 
Q391E+K444Q is constructed as follows: 

• PCR is carried out with primer x2 and primer 101458 on 
pTVB106-like plasmid (with mutations delta (T183-G18 4 ) + 

25 . K185R+A186T) . The resulting DNA fragment is used as a mega- 
primer together with primer Y2 in a PCR on pTVB106 like plasmid 
(with mutations delta (T183-G184 ) +Y243F+Q391E+K444Q) . The 
product of the second PCR is digested with restriction 
endonucleases Acc65I and Afllll and cloned into pTVB106 like 

30 plasmid (delta (T183-G184 ) +Y243F+Q391E+K4 44Q) digested with the 
same enzymes. 

Example 5 

Construction of site-directed g-amylase variants in the parent 
35 SP722 g-amylase (SEQ ID NO: 2) 

Construction of variants of amylase SEQ ID NO: 2 (SP722) is 
carried out as described below. 

The gene encoding the amylase from SEQ ID NO: 2 is located 
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in a plasmid pTVB112 described in WO 96/23873. The amylase is 
expressed from the amyL promoter in this construct in Bacillus 
subtilis . 

Construction of delta ( D183-G184 ) + V56I by the mega-primer 
5 method as described by Sarkar and Sommer, 1990 (BioTechniques 
8: 404-407) . 

Gene specific primer DA03 and mutagenic primer DA07 are 
used to amplify by PCR an approximately 820 bp DNA fragment 
from a pTVB112-like plasmid (with the delta ( D183-G184 ) 

10 mutations in the gene encoding the cc-amylase shown in SEQ ID 
NO: 2. 

The 820 bp fragment is purified from an agarose gel and 
used as a mega-primer together with primer DA01 in a second PCR 
carried out on the same template. 

15 The resulting approximately 920 bp fragment is digested 

with restriction enzymes NgoM I and Aat II and the resulting 
approximately 170 bp DNA fragment is purified and ligated with 
the pTVB112-like plasmid (with the delta ( D183-G184 ) mutations 
in the gene encoding the amylase shown in SEQ ID NO: 2) 

20 digested with the same enzymes. Competent Bacillus subtilis 
SHA273 (amylase and protease low) cells are transformed with 
the ligation and Chlorampenicol resistant transf ormants are 
checked by DNA sequencing to verify the presence of the correct 
mutations on the plasmid. 

25 primer DA01: (SEQ ID NO: 23) 
5' CCTAATGATGGGAATCACTGG 3' 
primer DA03: (SEQ ID NO:24) 
5' GCATTGGATGCTTTTGAACAACCG 3' 
primer DA07 (SEQ ID NO: 25): 

30 5' CGCAAAATGATATCGGGTATGGAGCC 3' 

Variants: delta ( D183-G184 ) + K108L, delta (D183-G184 ) + K108Q, 
delta(D183-G184) + K108E, delta (D183-G184) + K108V, were 
constructed by the mega-primer method as described by Sarkar 
and Sommer ,1990 (BioTechniques 8: 404-407): 

35 PCR is carried out with primer DA03 and mutagenesis primer 

DA20 on pTVB112-like plasmid (with mutations delta ( D183-G184 )) . 
The resulting DNA fragment is used as a mega-primer together 
with primer DA01 in a PCR on pTVB112-like plasmid (with 
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mutations delta (D183-G184 )) . The approximately 920 bp product 
of the second PCR is digested with restriction endonucleases 
Aat II and Mlu I and cloned into pTVB112-like plasmid 
(delta (D183-G184) ) digested with the same enzymes. 
5 primer DA20 (SQ ID NO: 26): 

5' GTGATGAACCACSWAGGTGGAGCTGATGC 3' 

S represents a mixture of the two bases: C and G used in 
the synthesis of the mutagenic oligonucleotide and W represents 
a mixture of the two bases: A and T used in the synthesis of 
10 the mutagenic oligonucleotide. 

Sequencing of transf ormants identifies the correct codon 
for amino acid position 108 in the mature amylase. 

Construction of the variants: delta (D183-G184 ) + D168A, 
delta (D183-G184) + D168I, delta ( D183-G184 ) + D168V, delta(D183- 
15 G184) + D168T is carried out in a similar way except that 
mutagenic primer DA14 is used, 
primer DA14 (SEQ ID NO:27): 
5' GATGGTGTATGGRYCAATCACGACAATTCC 3' 

R represents a mixture of the two bases: A and G used in 
the synthesis of the mutagenic oligonucleotide and Y represents 
a mixture of the two bases: C and T used in the synthesis of 
the mutagenic oligonucleotide. 

Sequencing of transf ormants identifies the correct codon 
for amino acid position 168 in the mature amylase. 

Construction of the variant: delta ( D183-G184 ) + Q169N is 
carried out in a similar way except that mutagenic primer DA15 
is used. 

primer DA15 (SEQ ID NO:28): 
5' GGTGTATGGGATAACTCACGACAATTCC 3' 

Construction of the variant: delta ( D183-G184 ) + Q169L is 
carried out in a similar way except that mutagenic primer DA16 
is used. 

primer DA16 (SEQ ID NO:29): 
5' GGTGTATGGGATCTCTCACGACAATTCC 3' 

Construction of the variant: delta ( D183-G184 ) + Q172N is 
carried out in a similar way except that mutagenic primer DA17 
is used. 

primer DA17 (SEQ ID NO:30): 



20 



25 



30 



35 
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5' GGGATCAATCACGAAATTTCCAAAATCGTATC 3' 

Construction of the variant: delta ( D183-G184 ) + Q172L is 
carried out in a similar way except that mutagenic primer DA18 
is used. 
5 primer DA18 (SEQ ID N0:31): 

5' GGGATCAATCACGACTCTTCCAAAATCGTATC 3' 

Construction of the variant: del ta ( D183-G184 ) + L201I is 
carried out in a similar way except that mutagenic primer DA06 
is used. 

10 primer DA06 (SEQ ID NO:32): 

5' GGAAATTATGATTATATCATGTATGCAGATGTAG 3' 

Construction of the variant: delta ( D183-G184 ) + K269S is 
carried out in a similar way except that mutagenic primer DA09 
is used. 

15 primer DA09 (SEQ ID NO: 33): 

5' GCTGAATTTTGGTCGAATGATTTAGGTGCC 3' 

Construction of the variant: delta ( D183-G184 ) . + K269Q is 
carried out in a similar way except that mutagenic primer DA11 
is used. 

20 primer DA11 (SEQ ID NO:34): 

5' GCTGAATTTTGGTCGAATGATTTAGGTGCC 3' 

Construction of the variant: delta ( D183-G184 ) + N270Y is 
carried out in a similar way except that mutagenic primer DA21 
is used, 

25 primer DA21 (SEQ ID NO:35): 

5' GAATTTTGGAAGTACGATTTAGGTCGG 3' 

Construction of the variants: delta (D183-G184 ) + L272A, 

delta (D183-G184) + L272I, delta (D183-G184 ) + L272V, delta(D183- 

G184) + L272T is carried out in a similar way except that 
30 mutagenic primer DA12 is used. 

primer DA12 (SEQ ID NO:36): 

5' GGAAAAACGATRYCGGTGCCTTGGAGAAC 3' 

R represents a mixture of the two bases: A and G used in the 
synthesis of the mutagenic oligonucleotide and Y represents a 
35 mixture of the two bases: C and T used in the synthesis of the 
mutagenic oligonucleotide . 

Sequencing of transf ormant s identifies the correct codon for 
amino acid position 272 in the mature amylase. 
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Construction of the variants: delta ( D183-G184 ) + L275A, 
delta(D183-G184) + L275I, delta (D183-G184 ) + L275V, delta(D183- 
G184) + L275T is carried out in a similar way except that 
mutagenic primer DA13 is used. 
5 primer DA13 (SEQ ID NO: 37): 

5' 1GATTTAGGTGCCTRYCAGAACTATTTA 3' 

R represents a mixture of the two bases: A and G used in the 

synthesis of the mutagenic oligonucleotide and Y represents a 

mixture of the two bases: C and T used in the synthesis of the 
10 mutagenic oligonucleotide. 

Sequencing of transf ormants identifies the correct codon for 

amino acid position 275 in the mature amylase. 

Construction of the variant: delta ( D183-G184 ) + Y295E is 

carried out in a similar way except that mutagenic primer DA08 
15 is used. 

primer DA08 (SEQ ID NO:38): 

5' CCCCCTTCATGAGAATCTTTATAACG 3' 

Construction of delta (D183-G184 ) + K446Q by the mega-primer 

method as described by Sarkar and Sommer, 1990 (BioTechniques 8: 
20 404-407) : 

Gene specific primer DA04, annealing 214-231 bp downstream 
relative to the STOP-codon and mutagenic primer DA10 were used 
to amplify by PCR an approximately 350 bp DNA fragment from a 
pTVB112-like plasmid (with the delta ( D183-G184 ) mutations in 

25 the gene encoding the amylase depicted in SEQ ID NO: 2) . 

The resulting DNA fragment is used as a mega-primer 
together with primer DA05 in a PCR on pTVB112 like plasmid 
(with mutations delta (D183-G184) ) . The app. 460 bp product of 
the second PCR is digested with restriction endonucleases SnaB 

30 I and Not I and cloned into pTVB112 like plasmid (delta (D183- 
G184)) digested with the same enzymes, 
primer DA04 (SEQ ID NO:39): 
5' GAATCCGAACCTCATTACACATTCG 3' 
primer DA05 (SEQ ID NO:40): 

35 5' CGGATGGACTCGAGAAGGAAATACCACG 3' 
primer DA10 (SEQ ID NO:41): 
5' CGTAGGGCAAAATCAGGCCGGTCAAGTTTGG 3' 

Construction of the variants: delta (D183-G184 ) + K458R is 
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carried out in a similar way except that mutagenic primer DA22 
is used. 

primer DA22 (SEQ ID NO:42): 
5' CATAACTGGAAATCGCCCGGGAACAGTTACG 3' 
5 Construction of the variants: delta ( D183-G184 ) + P459S and 

delta (D183-G184) + P459T is carried out in a similar way except 
that mutagenic primer DA19 is used, 
primer DA19 (SEQ ID NO:43): 
5' CTGGAAATAAAWCCGGAACAGTTACG 3' 
10 W represents a mixture of the two bases: A and T used in the 
synthesis of the mutagenic oligonucleotide. 

Sequencing of transf ormants identifies the correct codon for 
amino acid position 459 in the mature amylase. 

Construction of the variants: delta ( D183-G18 4 ) + T461P is 
15 carried out in a similar way except that mutagenic primer DA23 
is used. 

primer DA23 (SEQ ID NO:44): 

5' GGAAATAAACCAGGACCCGTTACGATCAATGC 3' 

Construction of the variant: delta (D183-G184 ) + K142R is 
20 carried out in a similar way except that mutagenic primer DA32 
is used. 

Primer DA32 (SEQ ID NO: 45) : 

5' GAGGCTTGGACTAGGTTTGATTTTCCAG 3' 

Construction of the variant: delta (D183-G184 ) + K269R is 
25 carried out in a similar way except that mutagenic primer DA31 
is used. 

Primer DA31 (SEQ ID NO: 46) : 

5' GCTGAATTTTGGCGCAATGATTTAGGTGCC 3' 

30 Example 6 

Construction of site-directed ^-amylase variants in the parent 
Termamyl g-amylase (SEQ ID NO: 4) 

The amyL gene, encoding the Termamyl a-amylase is located 
in plasmid pDN1528 described in WO 95/10603 (Novo Nordisk) . 
35 Variants with substitutions N265R and N265D, respectively, of 
said parent a-amylase are constructed by methods described in 
WO 97/41213 or by the "megaprimer" approach described above. 
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Mutagenic oligonucleotides are: 
Primer bll for the N265R substitution: 

5' PCC AGC GCG CCT AGG TCA CGC TGC CAA TAT TCA G (SEQ ID NO: 
5 56) 

Primer bl2 for the N265D substitution: 

5' - PCC AGC GCG CCT AGG TCA TCC TGC CAA TAT TCA G (SEQ ID NO: 
57) 

P represents a phosphate group. 

10 

Example 7 

Determination of pH stability at alkaline pH of variants of the 
parent a -Amylase having the amino acid sequence shown in SEQ ID 
NO: 2, 

15 In this serie of analysis purified enzyme samples were used. 
The measurements were made using solutions of the respective 
variants in 100 mM CAPS buffer adjusted to pH 10.5. The 
solutions were incubated at 75°C. 

After incubation for 20 and 30 min the residual activity was 

20 measured using the PNP-G7 assay (described in the "Materials 
and Methods" section above) . The residual activity in the 
samples was measured using Britton Robinson buffer pH 7*3. The 
decline in residual activity was measured relative to a 
corresponding reference solution of the same enzyme at 0 

25 minutes, which has not been incubated at high pH and 75°C. 

The percentage of the initial activity as a function is shown 
in the table below for the parent enzyme (SEQ ID NO: 2) and for 
the variants in question. 



variant 


Kesiduax activity 
after 20 rain 


Kesiauai activity 
after 30 min 


A(D183-G184)+M323L 


5b % 


44 % 


A(D183-G184) +M323L+R181S 




3b % 


A(D183-G184) +M323L+A186T 


bZ % 


bU % 



30 In an other series of analysis culture supernatants were 
used. The measurements were made using solutions of the 
respective variants in 100 mM CAPS buffer adjusted to pH 10.5. 
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The solutions were incubated at 80°C. 

After incubation for 30 minutes the residual activity was 
measured using the Phadebas assay (described in the "Materials 
and Method" secion above. The residual activity in the samples 
5 was measured using Britton Robinson buffer pH 7.3. The decline 
in residual activity was measured relative to a corresponding 
reference solution of the same enzyme at 0 minutes, which has 
not been incubated at high pH and 80°C. 

The percentage of the initial activity as a function is shown 
10 in the table below for the parent enzyme (SEQ ID NO: 2) and for 
the variants in question. 



variant 


Kesiauai activity atter 3U min 


A(D183-G184) 


q % 


A(D183-G184) +P459T 




A(D183-G184 ) +K458R 


Jl % 


A(D183-G184) +K311R 


1U * 



Example 8 

Determination of calcium stability at alkaline pH of variants 
15 of the parent g-Amylase having the amino acid sequence shown in 
SEQ ID NO: 1, SEQ ID NO: 2 and SEQ ID NO: 4. 

A: Calcium stability of variants of the sequence in SEQ ID N0:1 
The measurement were made using solutions of the respective 

20 variants in 100 mM CAPS buffer adjusted to pH 10.5 to which 
polyphosphate was added (at time. t=0) to give a "final 
concentration of 2400 ppm. The solutions were incubated at 50°C. 

After incubation for 20 and 30 minutes the residual activity 
was measured using the PNP-G7 assay (described above) . The 

25 residual activity in the samples was measured using Britton 
Robinson buffer pH 7.3. The decline in residual activity was 
measured relative to a corresponding reference solution of the 
same enzyme at 0 minutes, which has not been incubated at high 
pH and 50°C. 

30 The percentage of the initial activity as a function is shown 
in the table below for the parent enzyme (SEQ ID NO: 1) and for 
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the variants in question. 



Variant 


Kesiclual activi f v 
after 20 min 


KPSlflllfl 1 3ft"l V1TV 

l\t-<J X UQ X CI V — -1. V J. u. y 

after 30 min 


A(T183-G184) 


^ % 


iy % 


A(T183-G184) + A186T 




Zd> % 


A(T183-G184) +K185R+A186T 




^y % 


A(T183-G184) +A186I 


Jb % 




A (T183-G184 ) +N195F 


4 4 u « 


n . d . 



n . d . = Not determmatea 



B: Calcium stability of variants of the sequence in SEQ ID NO:2 
5 In this series of analysis purified samples of enzymes were 

used. The measurement were made using solutions of the 
respective variants in 100 mM CAPS buffer adjusted to pH 10.5 
to which polyphosphate was added (at time t = 0) to give a final 
concentration of 2400 ppm. The solutions were incubated at 50°C. 
10 After incubation for 20 and 30 minutes the residual activity 
was measured using the PNP-G7 assay (described above) . The 
residual activity in the samples was measured using Britton 
Robinson buffer pH 7.3. The decline in residual activity was 
measured relative to a corresponding reference solution of the 
15 same enzyme at 0 minutes, which has not been incubated at high 
pH and 50°C. 

The percentage of the initial activity as a function is shown 
in the table below for the parent enzyme (SEQ ID NO: 2) and for 
the variants in question. 



20 



variant 


Kesiduai activity 
after 20 min 


Kesiauai activity 
after 30 min 


A(D183-G184) +M323L 


zl % 


13 * 


A(D183-G18 4) +M323L+R181S 


. SI % 


iy % 


A(D183-G184) +M323L+A18 6T 


Z\i % 


l / % 


A (D18 3-G184) +M323L+A18 6R 


3U % 


id % 




variant 


Kesiauai activity 
after 20 min 


Kesiauai activity 
after 30 min 
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A(D183-G184 ) 




2U% 


A(D183-G184) +N195F 


55% 


44% 



In this serie of analysis culture supernatants were used. The 
measurement were made using solutions of the respective 
variants in 100 mM CAPS buffer adjusted to pH 10.5 to which 
5 polyphosphate was added (at time t=0) to give a final 
concentration of 2400 ppm. The solutions were incubated at 50°C. 

After incubation for 30 minutes the residual activity was 
measured using the Phadebas assay as described above. The 
residual activity in the samples was measured using Britton 
10 Robinson buffer pH 7.3. The decline in residual activity was 
measured relative to a corresponding reference solution of the 
same enzyme at 0 minutes, which has not been incubated at high 
pH and 50°C. 

The percentage of the initial activity as a function is shown 
15 in the table below for the parent enzyme (SEQ ID NO: 2) and for 
the variants in question. 



variant 


Kesiauai activity atter 3U min 


A(D183-G184) 


u « 


A(D183-G184) +P459T 


iy % 


A(D183-G184)+K458R 


±0 % 


A(D183-G184)+T461P 


u % — - 


A(D18 3-G184)+E34 6Q+K38 5R 


4 % 



C: Calcium stability of variants of the sequence in SEQ ID NO: 4 
20 The measurement were made using solutions of the respective 
variants in 100 mM CAPS buffer adjusted to pH 10.5 to which 
polyphosphate was added (at time t=0) to give a final 
concentration of 2400 ppm. The solutions were incubated at 60°C 
for 20 minutes. 

25 After incubation for 20 minutes the residual activity was 
measured using the PNP-G7 assay (described above) . The residual 
activity in the samples was measured using Britton Robinson 
buffer pH 7.3. The decline in residual activity was measured 
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relative to a corresponding reference solution of the same 
enzyme at 0. minutes, which has not been incubated at high pH 
and 60°C. 

The percentage of the initial activity as a function is shown 
5 in the table below for the parent enzyme (SEQ ID NO: 4) and for 
the variants in question. 



variant 


Kesictuai 
activity after 
2 0 min 


Termamyl (StX) ID NO: 4) 


1 / 15 


N^bbR 









Example 9 : 

10 Activity measurement at medium temperature of a-Amylases having 
the amino acid sequence shown in SEQ ID NO: 1. 
A: a -Amylase activity of variants of the sequence in SEQ ID 
NO: 1 

The measurement were made using solutions of the respective 
15 variants in 50 mM Britton Robinson buffer adjusted to pH 7.3 
and using the Phadebas assay described above. The activity in 
the samples was measured at 37°C using 50 mM Britton Robinson 
buffer pH 7,3 and at 25°C using 50 mM CAPS buffer pH 10.5. 

The temperature dependent activity and the percentage of the 
20 activity at 25°C relative to the activity at 37°C are shown in 
the table below for the parent enzyme (SEQ ID NO: 1) and for 
the variants in question. 



variant 


NU/mg 2 5°C 


NU/mg 37°C 


NU (25°C) 
/ NU(37°C) 




1440 


JSUUU 


4.1% 


A(T183-G184) 


^yuu 


1UUUU 


1.6 \ 


A(T183-G184) +K269S 


lHbU 




lb.b % 


A(Q174) 




jyuuu 


/ . y % 



Another measurement was made using solutions of the 
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respective variants in 50 mM Britton Robinso suffer adjusted 

to pH 7.3 and using the Phadebas assay des< bed above. The 

activity in the samples was measured at 37°C and 50°C using 50 
mM Britton Robinson buffer pH 7.3. 
5 The temperature dependent activity and the percentage of the 
activity at 37°C relative to the activity at 50°C is shown in 
the table below for the parent enzyme (SEQ ID NO: 1) and for 
the variants in question. 



variant 


NU/mg 37°C 


NU/mg 50°C 


NU(37°C) / 
NU (50°C) 


Sfbyu (seq ID NO: 1) 


i juyu 




bU % 


K2b9Q 


/8U4 


lUUbJ 


/« % 



B: g-Amylase activity of variants of the sequence in SEQ ID 
NO: 2 

The measurement were made using solutions of the respective 
variants in 50 mM Britton Robinson buffer adjusted to pH 7 . 3 
15 and using the Phadebas assay described above. The activity in 
the samples was measured at both 25°C and 37°C using 50 mM 
Britton Robinson buffer pH 7.3. 

The temperature dependent activity and the percentage of the 
activity at 25°C relative to the activity at 37°C is shown in 
20 the table below for the parent enzyme (SEQ ID NO: 2) and for 
the variants in question. 



variant 


NU/mg 
25°C 


NU/mg 
37 6 C 


NU(25°C) / 
NU(37°C) 


A(D183-G184) +M323L 


3U49 


iOZUA 


JU % 


A(D183-G184) +M323L+R181S 


18b9b 


3b43b 





C: ^-Amylase activity of variants of the sequence in SEQ ID 
NO : 4 

25 The measurement were made using solutions of the respective 
variants in 50 mM Britton Robinson buffer adjusted to pH 7.3 
and using the Phadebas assay described above. The activity in 
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the samples was measured at both 37°C using 50 mM Britton 

Robinson buffer pH 7.3 and at 60°C using 50 mM CAPS buffer pH 
10. 5. 

The temperature dependent activity and the percentage of the 
activity at 37°C relative to the activity at 60°C is shown in 
the table below for the parent enzyme (SEQ ID NO: 4) and for 
the variants in question. 



variant 


NU/mg 37°C 


NU/mg 60°C 


NU(37°C) / NU(60°C) 


'i'ermamyi 


MUU 




1 /u % 


Q2b4S 


1UUUU 


4bbU 


% 
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15 



Example 10 

Construction of variants of parent hybrid BAN : 1 - 
300/Termamyl : 301-483 ^ -amylase 

Plasmid pTVB191 contains the gene encoding hybrid a-arnylase 
BAN: I- 300/Termamyl : 301-4 8 3 as well as an origin of replication 
functional in Bacillus subtilis and the cat gene conferring 
chloramphenicol resistance . 

Variant BM4 (F290E) was constructed using the megaprimer 
approach (Sarkar and Sommer, 1990) with plasmid pTVB191 as 
template . 

Primer pi (SEQ ID NO: 52) and mutagenic oligonucleotide bm4 
(SEQ ID NO: 47) were used to amplify a 444 bp fragment with 
polymerase chain reaction (PCR) under standard conditions. 
This fragment was purified from an agarose gel and used as 
'Megaprimer' in a second PCR with primer p2 (SEQ ID NO: 53) 
resulting in a 531 bp fragment. This fragment was digested with 
restriction endonucleases HinDIII and Tthllll. The 389 bp 
fragment produced by this was ligated into plasmid pTVB191 that 
20 had been cleaved with the same two enzymes. The resulting 
plasmid was transformed into B . subtilis SHA273. Chloramphenicol 
resistant clones were selected by growing the transf ormants on 
plates containing chloramphenicol as well as insoluble starch. 
Clones expressing an active a-amylase were isolated by selecting 
25 clones that formed halos after staining the plates with iodine 
.vapour. The identity of the introduced mutations was confirmed 
by DNA sequencing • 

Variants BM5(F290K), BM6(F290A), BM8(Q360E) and BM11 (N102D) 
were constructed in a similar way. Details of their construction 
30 are given below. 

Variant: BM5(F290K) 

mutagenic oligonucleotide: bm5 (SEQ ID NO: 48) 
Primer (1st PCR): pi (SEQ ID NO: 52) 
35 Size of resulting fragment: 444 bp 
Primer (2nd PCR): p2 (SEQ ID NO: 53) 
Restriction endonucleases: HinDIII, Tthllll 
Size of cleaved fragment: 389 bp 



<WO 992321 1A1_I_> 



WO 99/23211 



57 



PCT/DK98/00471 - 



Variant: BM6 ( F290A) 

mutagenic oligonucleotide: bm6 (SEQ ID NO: 49) 
Primer (1st PCR) : pi (SEQ ID NO: 52) 
5 Size of resulting fragment: 444 bp 
Primer (2nd PCR) : p2 (SEQ ID NO: 53) 
Restriction endonucleases : HinDIII, Tthllll 
Size of cleaved fragment: 389 bp 

10 Variant: BM8 (Q360E) 

mutagenic oligonucleotide: bm8 (SEQ ID NO: 50) 
Primer (1st PCR) : pi (SEQ ID NO: 52) 
Size of resulting fragment : 230 bp 
Primer (2nd PCR) : p2 (SEQ ID NO: 53) 

15 Restriction endonucleases : HinDIII , Tthllll 
Size of cleaved fragment: 389 bp 



Variant: BM11 (N102D) 

mutagenic oligonucleotide: bmll (SEQ ID NO: 51) 
20 Primer (1st PCR): p3 (SEQ ID NO: 54) 
Size of resulting fragment: 577 
Primer (2nd PCR) : p4 (SEQ ID NO: 55) 
Restriction endonucleases: HinDIII, Pvul 
Size of cleaved fragment: 576 

25 

Mutagenic oligonucleotides : 

bm4 (SEQ ID NO: 47) : F290E 

primer 5' GTG TTT GAC GTC CCG 

bm5 (SEQ ID NO: 48): F290K 
30 primer 5 1 GTG TTT GAC GTC CCG 

bm6 (SEQ ID NO: 49): F290A 

primer 5 1 GTG TTT GAC GTC CCG 

bm8 (SEQ ID NO: 50): Q360E 

primer 5 ' AGG GAA TCC GGA TAC 
35 bmll (SEQ ID NO: 51) : N102D 

primer 5 1 GAT GTG GTT TTG 



CTT CAT GAG AAT TTA CAG G 
CTT CAT AAG AAT TTA CAG G 
CTT CAT GCC AAT TTA CAG G 
CCT GAG GTT TTC TAC GG 
GAT CAT AAG GCC GGC GCT GAT G 



Other primers : 
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pi: 5' C: TTA ~7A ATG C r G CCA AAC C (SEQ ID NO: 52) 
p2: '5' G GAA AAG AAA TGT TTA CGG TTG CG (SEQ ID NO: 53) 
p3: 5' G AAA TGA AGC GGA ACA TCA AAC ACG (SEQ ID NO: 54) 
p4 : 5' GTA TGA TTT AGG AGA ATT CC (SEQ ID NO: 55) 

Example 11 

^ -Amylase activity at alkaline pH of variants of parent BAN:1- 
300/Termamyl : 301-483 hybrid a -amylase. 

The measurements were made using solutions for the 
respective enzymes and utilizing the Phadebas assay (described 
above) f The activity was measured after incubating for 15 
minutes at 30°C in 50 mM Britton-Robinson buffer adjusted to the 
indicated pH by NaOH. 

NU/mg enzyme 

pH wt Q360E F290A F290K F290E 

8.0 5300 7800 8300 4200 6600 

9,0 1600 2700 3400 2100 1900 



N102D 

6200 

1900 



.992321 1A1_I_> 



WO 99/23211 
REFERENCES CITED 



PCT/DK98/00471- 



Klein, C, et al., Biochemistry 1992, 31, 8740-8746, 
Mizuno, H., et al., J. Mol. Biol. (1993) 234, 1282-1283, 

5 Chang, C, et al, J. Mol. Biol. (1993) 229, 235-238, 

Larson, S.B., J. Mol. Biol. (1994) 235, 1560-1584, 
; Lawson, C.L., J. Mol. Biol. (1994) 236, 590-600, 

Qian, M., et al., J . Mol. Biol. (1993) 231, 785-799, 

Brady, R.L., et al., Acta Crystallogr. sect. B, 47, 527-535, 

10 Swift, H.J., et al., Acta Crystallogr. sect. B, 47, 535-544 

A. Kadziola, Ph.D. Thesis: "An alpha-amylase from Barley and its 
Complex with a Substrate Analogue Inhibitor Studied by X-ray 
Crystallography", Department of Chemistry University of 
Copenhagen 1993 

15 MacGregor, E.A., Food Hydrocolloids , 1987, Vol.1, No. 5-6, p. 

B. Diderichsen and L. Christiansen, Cloning of a maltogenic a- 
amylase from Bacillus stearothermophilus, FEMS Microbiol, let- 
ters: 56: pp. 53-60 (1988) 

Hudson et al., Practical Immunology, Third edition (1989), 
20 Blackwell Scientific Publications, 

Sambrook et al., Molecular Cloning: A Laboratory Manual , 2nd 
Ed., Cold Spring Harbor, 1989 

S.L. Beaucage and M.H. Caruthers, Tetrahedron Letters 22 , 1981, 
pp. 1859-1869 

25 Matthes et al., The EMBQ J. 3, 1984, pp. 801-805. 
R.K. Saiki et al., Science 2 39 / 1988, pp. 487-491. 
Morinaga et al., (1984, Biotechnology 2:646-639) 

Nelson and Long, Analytical Biochemistry 180 , 1989, pp. 147-151 
Hunkapiller et al., 1984, Nature 310:105-111 
30 R. Higuchi, B. Krummel, and R.K. Saiki (1988). A general method 
of in vitro preparation and specific mutagenesis of DNA frag- 
ments: study of protein and DNA interactions. Nucl . Acids Res. 
16:7351-7367. 

Dubnau et al., 1971, J. Mol. Biol. 56 , pp. 209-221. 
35 Gryczan et al., 1978, J. Bacteriol. 134 , pp. 318-329. 

S.D. Erlich, 1977, Proc . Natl. Acad. Sci. 7_4, pp. 1680-1682. 
Boel et al., 1990, Biochemistry 29 , pp. 6244-6249. 



BNSDOCIO: <WO 9923211 A1_l_> 



WO 99/2321 1 PCT/DK98/00471 

60 



CLAIMS 



1. A variant of a parent Termamyl-like cc-amylase, which variant 
has a-amylase activity, said variant comprises one or more 
5 mutations corresponding to the following mutations in the amino 
acid sequence shown in SEQ ID NO: 2: 

T141, K142, F143, D144, F145, P146, G147, R148, G149, 
Q174, R181, G182, D183, G184, K185, A186, W189, S193, N195 
H107, K108, G109,D166, W167, D168, Q169, S170, R171, Q172, F173, 
10 F267, W268, K269, N270, D271, L272, G273, A274, L275, K311, 
E346, K385, G456, N457, K458,P459, G460, T461, V462, T463. 



2. The variant according to claim 1, which variant has one or 
more of the following substitutions or deletions: 



T141A, 


D, 


R, 


N, 


C,E,Q, 


G, H, 


I, 


L, 


K, 


M, F, P, 


S, W, Y, 


V 


K142A, 


D, 


R, 


N, 


C,E,Q, 


G, H, 


I, 


L, 


M, 


F,P,S, 


T,W, Y, 


V 


F143A, 


D, 


R, 


N, 


C, E, Q, 


G, H, 


I, 


L, 


K, 


M,P,S, 


T, W, Y, 


V 


D144A, 


R, 


N , 


c, 


E, Q, G, 


H, I, 


L, 


K, 


M, 


F,P,S, 


T,W, Y, 


V 


F145A, 


D, 


R, 


N, 


C,E,Q, 


G, H, 


I, 


L, 


K, 


M,P,S, 


T,W, Y, 


V 


P146A, 


D, 


R, 


N, 


C, E, Q, 


G, H, 


I, 


L, 


K, 


M,F,S, 


T, W, Y, 


V 


G147A, 


D, 


R/ 


N, 


C,E,Q, 


H,I, 


L, 


K, 


M, 


F,P,S, 


T,W, Y, 


V 


R148A, 


D, 


N, 


c, 


E,Q,G, 


H,I, 


L, 


K, 


M, 


F,P,S, 


T,W, Y, 


V 


G149A, 


D, 


R, 


N, 


C,E,Q, 


H,I, 


L, 


K, 


M# 


F,P,S, 


T, W, Y, 


V 


R181*, 


A, 


D, 


N, 


C,E,Q, 


G, H, 


I, 


L, 


K # 


M,F, P, 


S,T,W, 


Y 


G182*, 


A, 


D, 


R/ 


N,C,E, 


Q,H, 


I, 


L, 


K, 


M,F,P, 


S,T,W, 


Y 


D183*, 


A, 


R, 


N, 


■ C,E,Q, 


G, H, 


I, 


L, 


K, 


M,F,P, 


S,T,W, 


Y 


G184*, 


A, 


R, 


D, 


,N,C,E, 


Q,H, 


I, 


L, 


K, 


M,F,P, 


S,T,W, 


Y 


K185A, 


D, 


R, 


N, 


C,E,Q, 


G,H, 


I, 


L, 


M, 


F,P,S, 


T,W, Y, 


V 


A186D, 


R, 


N , 


c, 


.E,Q,G, 


H,I, 


L, 


K, 


M, 


F,P,S, 


T,W, Y, 


V 


W189A, 


D, 


R, 


N, 


C,E,Q, 


G,H, 


I, 


L, 


K, 


M, F, P, 


S,T, Y, 


V 


S193A, 


D, 


R/ 


N, 


C,E,Q, 


G, H, 


I, 


L, 


K, 


M,F,P, 


T,W, Y, 


V 


N195A, 


D, 


R, 


-C 


- E, Q, G, 


H, 1/ 


L, 


K, 


M, 


F,P,S, 


T,W, Y, 


V 


H107A, 


D, 


R< 


N, 


-C,E,Q, 


G,I, 


L, 


K< 


M, 


F,P,S, 


T,W, Y, 


V 


K108A, 


D, 


R, 


, N , 


-C,E,Q ( 


G,H, 


I, 


L, 


M, 


F,P,S, 


T, W, Y, 


V 


G109A, 


D, 


,R, 


, N, 


-C, E,Q ( 


H,I, 


L, 


K, 


M, 


F,P,S, 


T,W, Y, 


V 


D166A, 


R, 


. N, 


-C, 


- E, Q, G, 


H,I, 


L, 


K, 


M, 


F,P,S, 


T, W, Y, 


V 


W167A, 


D, 


,R, 


, N , 


■ C,E,Q, 


G, H, 


I, 


L, 


K, 


M, F, P, 


S,T, Y, 


V 


D168A, 


R, 


, N , 


,C 


-E,Q,G ( 


H, I, 


L, 


K, 


M, 


F, P,S, 


T, W, Y, 


V 
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Q169A, D,R,N,C,E,G, H, I , L , K , M , F , P , 


, S 


J 




Y 






S170A, D, R, N,C, E,Q, G, H, I, L, K,M, F, 


, P 


rT 


r W 


Y 


f v ( 




R171A, D f N,C, E,Q,G, H, I, L, K,M, F, P, 


- s 


J 


f w, 


Y 


t v, 




Q172A, D, R, N,C, E, G, H, I, L, K,M, F, P, 


,s 


rT 




Y 


r V, 


5 


F17 3A, D, R,N,C,E,Q,G,H, I,L,K,M, P, 


, s 


r T 


w, 


Y 


r V, 




Q174*, A, D, R, N,C, E, G, H, I, L, K,M, F, 


> p. 


r S t 


T, 


W 


r Y r 




F2'67A, D, R,N,C, E,Q,G,H, I,L,K,M, P, 


s. 




w # 


Y, 


r V; 




W2 68A, D, R, N, C, E,Q, G, H, I, L, K, M, F, 


P, 


s, 


T, 


Y, 


V; 




K2 69A, D, R, N, C, E,Q, G, H, I, L,M, F, P # 


s, 


T, 


w, 


Y, 


V; 


10 


N2 7 0A, D, R, C, E,Q,G, H, I, L, K,M, F, P, 


s, 


T, 


W, 


Y, 


V; 




D2 71A, R, N, C, E, Q,G, H, I, L, K,M, F, P, 


s, 


T, 


W, 


Y, 


V; 




L272A,D,R,N,C,E,Q,G,H / I / K,M f F,P / 


s, 


T, 


W, 


Y, 


V; 




G2 7 3A, D, R, N,C, E,Q, H, I, L, K, M, F, P, 


s, 


T, 


w, 


Y, 


V; 




A2 74D, R , N , C , E , Q , G , H , I , L , K , M , F , P , 


s, 


T, 


w, 


Y, 


V; 


15 


L2 7 5A, D,R,N,C,E,Q,G,H, I , K, M, F, P, 


s, 


T, 


w, 


Y, 


V; 




K311A, D,R,N,C,E,Q,G,H, I , L, M, F, P, 


s, 


T, 


w, 


Y, 


V; 




E34 6A, D,R,N,C, Q f G, H, I, K,L,M, F, P, 


s, 


T, 


w, 


Y, 


V; 




K38 5A, D, R, N,C, E,Q, G, H, I, L,M, F, P, 


s, 


T, 


w, 


Y, 


V; 




G4 56A, D,R,N,C,E,Q, H, I , L, K, M, F, P, 


s, 


T, 


w, 


Y, 


V; 


20 


N4 57A, D,R,C,E,Q,G,H, I , L, K, M, F, P, 


s, 


T, 


w, 


Y, 


V; 




K4 58A, D,R,N,C, E,Q,G,H, I,L,M, F, P, 


s, 


T, 


w, 


Y, 


V; 




P4 59A, D / R / N / C^E / Q / G f H / I / L / K,M, F, 


s, 


T, 


w, 


Y, 


V; 




G4 60A, D,R,N,C,E,Q,H, I , L, K, M, F, P, 


s, 


T, 


w, 


Y f 


V; 




T4 61A, D,R,N,C, E,Q,G,H, I,L,K,M, F, 


P, 


s, 


w., 


Y, 


V; 


25 


V4 62A, D,R,N,C, E,Q, G,H, I, L, K,M, F, 


P, 


s, 


T, 


■W, 


Y; 




T4 63A, D,R, N,C, E, Q, G, H, I , L, K, M, F, 


p, 


s, 


w, 


Y, 


V. 



3. The variant according to claim 2, wherein the variant has one 
or more of the following substitutions or deletions: 
30 K142R; S193P; N195F; K269R,Q; N270Y,R,D; K311R; E346Q; K385R; 
K458R; P459T; T461P; Q174*; R181Q,N,S; G182T,S,N; D183*; G184*; 
KISSA^R^D^CE^Q^CH^I^^^N^,?^,?,**, Y,V; A18 6T, S , N, I , V, R; 

W189T, S,N,Q. 

35 4. The variant according to claims 1-3, wherein the variant has 
a deletion in position D183 + G184, and further one or more of 
the following substitutions or deletions: K142R; S193P; N195F; 
K269R,Q; N270Y,R,D; K311R; E346Q; K385R; K458R; P459T; T461P; 



BNSDOCID: <WO 992321 1 A1 J_> 



WO 99/2321 1 PCT/DK98/00471 - 

Q174*; R181Q,N,S; G182T,S,W. 63* ; G184*; 

K185A,R, D,C, E,Q,G, H, I , L, M, N, F, P,S,T,W,Y,V; Al 86T , S , N , I , V, R; 

W18 9T,S,' ! 



5 5. The variant according to any of claims 1-4, wherein the 
variants exhibits an alteration in at least one of the following 
properties relative to the parent a-amylase: 

i) improved pH stability at a pH from 8 to 10.5; and/or 

ii) improved Ca 2+ stability at pH 8 to 10.5, and/or 

10 iii) increased specific activity at temperatures from 10 to 60°C, 
preferably 20-50°C, especially 30-40°C. 

6. The variant according to any of claims 1-5, exhibiting 
improved stability at pH 8 to 10.5, having mutations in one or 
15 more of the position (s) corresponding to the following positions 
(using SEQ ID NO: 2 numbering): T141, K142, F143, D144, F145, 
P146, G147, R148, G149, R181, A186, S193, N195, K269, N270, 
K311, K458, P459, T461. 



20 7. The variant according to claim 6, which variant has one or 

more of the following substitutions: 

T141A, D, R,N,C,E,Q,G,H, I, L, K, M, F, P, S, W, Y, V 

K142A, D,R,N,C,E,Q,G,H / I , L, M, F, P, S, T, W, Y, V 

F14 3A, D, R,N,C,E,Q,G,H, I,L,K,M, P,S,T,W,Y,V 
25 D14 4A, R,N,C,E,Q,G, H, I , L, K, M, F, P,S,T,W,Y,V 

F145A, D, R,N,C, E,Q,G,H, I,L,K,M, P,S,T,W,Y, V 

P14 6A, D,R,N,C,E,Q,G,H, I,L,K,M, F,S,T,W,Y, V 

G147A, D,R,N,C,E,Q,H,I,L,K,M,F, P,S,T,W,Y,V 

R148A, D,N,C,E,Q,G, H, I,L,K,M, F, P, S, T, W, Y, V 
30 G14 9A,D / R,N,C,E,Q / H,I,L,K,M,F / P,S,T,W,Y,V 

K181A, D,R,N,C,E,Q,G,H,I,L,M,F,P,S,T,W,Y,V 

A18 6D, R,N,C,E,Q,G,H, I,L, P,K,M, F,S,T,W,Y, V 

S193A, D, R,N,C, E, Q, G, H, I, L, K, M, F, P,T,W, Y, V 

N195A, D,R,C,E,Q,G,H,I,L,K,M,F,P,S,T,W,Y,V 
35 K2 69A, D,R,N,C,E,Q,G,H,I,L,M,F,P,S,T,W,Y,V 

N270A, D, R,C,E,Q,G,H, I,L, K,M, F, P, S,T,W, Y, V 

K311A, D,R,N,C, E,Q,G,H, I, L,M, F, P, S,T,W, Y, V 
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K458A, D,R,N,C,E,Q,G / H,I,L,M,F,P,S,T,W,Y,V; 
P459A, D,R,N,C,E,Q,G,H,I,L,K,M, F,S,T,W,Y,V; 
T4 61A,D,R,N,C,E,Q,G,H,I,L,K,M,F / P,S,W,Y,V. 

5 8. The variant according to claim 7, wherein the variant has one 
..or more of the following substitutions: K142R, R181S, A186T, 
S193P, N195F, K269R, N270Y, K311R, K458R, P459T and T461P. 



10 



15 



20 



30 



9. The variant according to claims 1-5, exhibiting improved Ca 2t 
stability at pH 8 to 10.5, having mutations in one or more of 
the following positions (using the SEQ ID NO: 2 numbering): 
R181, G182, D183, G184, K185, A186, W189, N195, N270, E346, 
K385, K458, P459. 



10. The variant according to claim 9, which variant has one or 
more of the following substitutions or deletions: 
R181*,A, D,N,C,E,Q,G,H,I,L,K,M,F, P,S,T,W,Y,V; 
G182*,A, D, R,N,C,E,Q,H, I,L,K,M, F, P, S,T,W, Y, V; 
D183*,A, R,N,C,E,Q,G,H, I,L,K,M, F, P, S,T,W,Y,V; 
G184*,A, R, D, N, C, E, Q, H, I,L,K,M, F, P, S,T,W, Y, V; 
K185A, D, R,N,C,E,Q,G,H, I,L,M, F, P, S, T, W, Y, V; 
A186D, R/N,C / E,Q,G / H / I,L,K / M,F,P,S / T,W,Y,V; 
W189A,D, R,N,C,E,Q,G,H, I , L, K, M, F, P, S, T, Y, V; 
N195A, D, R, C, E,Q,G, H, I, L, K,M/ F, P, S, T, W, Y,V; 
25 N270A, RfD/N,C,E / Q,H,I,L,K,M / F / P,S,T,W,Y,V; 
E346A, RfD,N,C,Q,G,H / I,L,K / M,F,P,S / T,W / Y / V; 
K385A,R, D,N,C,E,Q,G,H, I, L,M, F, P,S, T,W,Y, V; 
K458A,R, D.lirCErQ^^,!,!,^,?,?^,?,^^^; 
P4 59A,R, D,N f C / E,Q,G,H,I,L / K / M,F,S / T / W f Y,V. 



11. The variant according to claim 10, wherein the variant has 
one or more of the following substitutions or deletions: 
R181Q,N; G182T,S,N; D183*; G184*; 

K185A,R,D,C,E,Q,G,H,I,L,M,N,F,P,S,T,W,Y,V; A18 6T, S, N, I, V; 
35 W189T,S,N,Q; N195F; N270R, D; E346Q; K385R; K458R; P459T.' 

12. A variant according to claims 1-11, wherein the parent 
Termamyl-like oc-amylase is selected from: 
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the Bacillus strain NCIB 12512 a-amylase having the sequence 
shown in SEQ ID NO: 1; 

the S. amyloliquefaciens a-amylase having the sequence shown in 
SEQ ID NO: 5; 

5 the B. licheniformis a-amylase having the sequence shown in SEQ 
ID NO: 4 . 

13. The variant according to claims 1-5, exhibiting increased 
specific activity at a temperatures from 10 to 60°C, preferably 

10 20-50°C, especially 30-40°C, having mutation (s) in one or more of 
the following positions (using the SEQ ID NO: 2 numbering): 
H107, K108, G109, D166, W167, D168, Q169, S170, R171, Q172, 
F173, Q174, D183, G184, N195, F267, W268, K269,N270, D271, L272, 
G273, A274, L275, G456, N457, K458, P459, G460, T461, V462, 

15 T463. 

14. The variant according to claim 13, which variant has one or 
more of the following substitutions: 



H107A, D, R, 


N,C, 


E, 


Q,G, I, 


L, 


K, 


M, 


F, P, 


s, 


T, W, Y, V 


K108A, D, R, 


N,C, 


E, 


Q,G,H, 


I, 


L, 


M, 


F,P, 


s, 


T, W, Y, V 


G109A, D, R, 


N,C, 


E ( 


Q,H, I. 


L, 


K, 


M, 


F,P, 


s, 


T,W, Y,V 


D166A,R,N, 


C,E, 


Q, 


G,H, I, 


L, 


K, 


M, 


F, P, 


s, 


T, W, Y, V 


W167A, D, R, 


N,C, 


E, 


Q,G,H, 


Ir 


L, 


K, 


M/ F, 


p, 


S,T, Y,V 


D168A,R,N, 


C,E, 


Q, 


G,H, I, 


L, 


K, 


M, 


F,P, 


s, 


T, W, Y,V 


Q169A, D, R, 


N,C, 


E, 


G,H,I, 


L, 


K, 


M, 


F, P, 


s, 


T,W,Y,V 


S170A,D,R, 


N,C, 


E, 


Q,G,H, 


I, 


L, 


K, 


M,F, 


p, 


T,W, Y,V 


R171A, D, N, 


C,E, 


Q. 


G,H,I, 


L, 


K, 


M, 


F,P, 


s, 


T, W, Y, V 


Q172A, D, R, 


N,C, 


E, 


G,H,I, 


L, 


K, 


M, 


F,P, 


s, 


T, W, Y, V 


F173A, D, R, 


N,C, 


E, 


Q,G,H ( 


I > 


L, 


- K, 


M,P, 


s, 


T, W, Y, V 


Q174*,A, D, 


R,N, 


,C, 


E, G, H, 


■ I, 


L, 


, K, 


M,F, 


p, 


S, T, W, Y 


D183*,A, D, 


R, N, 


c, 


E, Q, G, 


-H, 


I, 


L, 


K,M,F, 


P,S,W,Y 


G184 * , A, R, 


N,C, 


E, 


r Q» G, H, 


I, 


L, 


K, 


M, F, 


p, 


S, T, W, Y 


N195A, D, R, 


C,E, 


rQ, 


-G,H,I, 


-L, 


K, 


-M, 


F,P, 


s, 


T,W, Y,V 


F267A, D, R, 


,N,C, 


- E, 


-Q,G,H, 


- I, 


L, 


, K, 


M,P, 


s, 


T, W, Y, V 


W268A, D, R, 


N,C 


- E, 


,Q.G,H, 


-I, 


L, 


- K, 


M, F, 


p, 


S, T, Y, V 


K269A, D, R, 


N,C, 


,E ( 


, Q, G, H, 


-I, 


L, 


-M, 


-F,P, 


s, 


T,W,Y,V 


N270A, D, R, 


,C, E 


rQ, 


,G,H,I, 


r L, 


K, 


, M, 


F,P, 


s, 


T,W, Y,V 
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D271A,R,N,C,E,Q,G,H,I,L,K,M, F, P, S, T, W, Y, V; 
L272A, D, R,N,C,E,Q,G, H, I, K # M, F, P, S, T, W, Y, V; 
G273A, D, R,N,C,E,Q,H, I,L, K, M, F, P, S, T, W, Y, V; 
A274D,R,N,C,E,Q,G,H, I,L, K, M, F, P, S, T, W, Y, V; 
5 L27 5A, D,R,N,C,E,Q,G,H, I, K,M, F, P, S, T, W, Y, V; 
G4 56A, D,R,N,C,E,Q,H, I,L,K,M, F, P, S,T,W, Y,V; 
N457A, D,R,C,E,Q,G,H,I,L,K,M,F / P,S,T,W,Y,V; 
K4 58A, D,R,N,C,E,Q,G,H, I,L,M, F, P, S , T , W , Y , V ; 
P45 9A,D,R,N,C,E,Q,G,H,I,L,K,M,F,S,T,W,Y,V; 
10 G4 60A,D,R,N,C,E,Q,H,I,L,K,M, F, P, S, T, W, Y, V; 
T4 61A, D, R,N,C, E,Q,G, H, I, L, K, M, F, P, S,W, Y, V; 
V4 62A, D,R,N,C,E,Q,G,H, I , L , K , M , F , P,S,T,W,Y; 
T4 63A, D,R,N,C,E,Q,G,H, I,L,K,M, F, P, S,W,Y,V. 

15 15. The variant according to claim 14, wherein the variant has 
one or more of the following substitutions or deletions: 
Q174*, D183*, G184*, N195F, K269S. 

16. The variant according to claims 13-15, wherein the parent 
Termamyl-like a-amylase is the B. licheniformis a-amylase having 
the sequence shown in SEQ ID NO: 4. 

17. A DNA construct comprising a DNA sequence encoding an a- 
amylase variant according to any one of claims 1-16. 

18. A recombinant expression vector which carries a DNA con- 
struct according to claim 17. 

19. A cell which is transformed with a DNA construct according 
30 to claim 17 or a vector according to claim 18. 

20. A cell according to claim 19, which is a microorganism. 

21. A cell according to claim 20, which is a bacterium or a 
35 fungus . 

22. The cell according to claim 21, which is a Gram positive 
bacterium such as Bacillus subtilis, Bacillus licheniformis, 



20 



25 
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Bacillus lentus, Bacillus brevis, Bacillus stearothermophi lus , 
Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus 
coagulans f Bacillus circulans, Bacillus lautus or Bacillus thu- 
ringiensis . 

5 

23. Use of an a-amylase variant according to any one of claims 
1-16 for washing and/or dishwashing. 

24. A detergent additive comprising an a-amylase variant accor- 
10 ding to any one of claims 1-16, optionally in the form of a non- 
dusting granulate, stabilized liquid or protected enzyme. 

25. A detergent additive according to claim 24 which contains 
0.02-200 mg of enzyme protein/g of the additive. 

15 

26. A detergent additive according to claims 24 or 25, which 
additionally comprises another enzyme such as a protease, a 
lipase, a peroxidase, another amylolytic enzyme and/or a 
cellulase . 

20 

27. A detergent composition comprising an a-amylase. variant ac- 
cording to any of claims 1-16. 

28. A detergent composition according to claim 27 which addi- 
25 . tionally comprises another enzyme such as a protease, a lipase, 

a peroxidase, another amylolytic enzyme and/or a cellulase. 

29. A manual or automatic dishwashing detergent composition 
comprising an a-amylase variant according to any one of claims 

30 1-16. 

30. A dishwashing detergent composition according to claim 29 
which additionally comprises another enzyme such as a protease, 
a lipase, a peroxidase, another amylolytic enzyme and/or a 

35 cellulase. 

31. A manual or automatic laundry washing composition comprising 
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an a-amylase variant according to any one of claims 1-16. 

32. A laundry washing composition according to claim 31, which 
additionally comprises another enzyme such as a protease, a 

5 lipase, a peroxidase, an amylolytic enzyme and/or a cellulase. 

33. Method for providing oc-amylases with 

1) altered pH optimum, and/or 

2) altered temperature optimum, and/or 
10 3) improved stability, 

comprising the following steps: 

i) identifying (a) target position (s) and/or region (s) for 
mutation of the a-amylase by comparing the molecular dynamics of 
two or more a-amylase' s 3D structures having substantially 

15 different pH, temperature and/or stability profiles, 

ii) substituting, adding and/or deleting one or more amino acids 
in the identified position(s) and/or region(s). 

34. The method according to claim 33, wherein a medium 
20 temperature a-amylase is compared with a high temperature a- 

amylase. 

35. The method according to claim 33, wherein a low temperature 
a-amylase is compared with a medium or high temperature a- 

25 amylase. 

36. The method according to claims 33-35, wherein the a-amylases 
are at least 70%, preferably 80%, up to 90%, such as up to 95%, 
especially 95% homologous. 

30 

37. The method according to claim 36, wherein the a-amylases 
compared are Termamyl-like a-amylases. 

38. The method according to claim 28, wherein the a-amylases 

35 compared are any of the a-amylases shown in SEQ ID NO: 1 to SEQ 
ID NO: 8. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 
5 (A) NAME: NOVO NORDISK A/S 

(B) STREET: Novo Alle 
■(C) CITY: DK-2880 Bagsvaerd 
{ E ) COUNTRY: Denmark 
(F) POSTAL CODE (ZIP): DK-2880 
10 (G) TELEPHONE: + 45 44 44 88 88 

(H) TELEFAX: +45 44 49 32 56 
- (ii) TITLE OF INVENTION: a-amylase mutants 
(iii) NUMBER OF SEQUENCES: 4 6 
(iv) COMPUTER READABLE FORM: 
15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS-DOS 

(D) SOFTWARE: Patentln Release •# 1 . 0 , Version #1.25 (EPO) 

20 (2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 ( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
30 1 5 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
20 25 30 

35 Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 

35 40 45 



40 



55 



Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 

65 70 75 80 



Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
45 85 90 95 

He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

50 Gly Thr Glu He Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 

115 120 125 



Gin Glu Thr Ser Gly Glu Tyr Ala He Glu Ala Trp Thr Lys Phe Asp 

130 135 140 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 

145 150 155 160 
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His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
165 170 175 

lie Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 . -185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 

lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala lie Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Tyr Tyr Asp Met Arg Asn lie Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
340 345 350 

Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 

Lys He Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asp He He Gly Trp Thr Arg Glu 
405 410 415 

Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
420 425 430 



Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
435 440 445 



Gin Val Trp Arg Asp He Thr Gly Asn Arg Thr Gly Thr Val Thr He 
450 455 460 
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Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

Val Trp Val Lys Gin 
5 485 

(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



15 



30 



45 



His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp His 
1 5 10 15 



Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 
20 20 25 30 

Asn Leu Arg Asn Arg Gly lie Thr Ala lie Trp lie Pro Pro Ala Trp 
35 40 45 

25 Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 

50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 



Thr Arg Ser Gin Leu Glu Ser Ala lie His Ala Leu Lys Asn Asn Gly 
85 90 95 



Val Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
35 100 105 110 

Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

40 Gin Glu lie Ser Gly Asp Tyr Thr lie Glu Ala Trp Thr Lys Phe Asp 

130 * 135 140 

Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 
145 150 155 160 



His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 
165 170 175 



lie Tyr Lys Phe Arg Gly Asp Gly Lys Ala Trp Asp Trp Glu Val Asp 
50 180 185 190 

Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

55 Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr 

210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
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225 230 235 . 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Ala 

245 250 255 

5 

Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 

260 265 270 

Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 

10 275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 

290 t 295 300 

15 Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 

305 310 315 320 



20 



35 



His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ser Leu Glu Ser Phe Val Gin Glu Trp Phe Lys Pro Leu Ala 
340 345 350 



Tyr Ala Leu He Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
25 355 360 365 

Gly Asp Tyr Tyr Gly He Pro Thr His Ser Val Pro Ala Met Lys Ala 
370 375 380 

30 Lys He Asp Pro lie Leu Glu Ala Arg Gin Asn Phe Ala Tyr Gly Thr 

385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asn He lie Gly Trp Thr Arg Glu 
405 410 415 



Gly Asn Thr Thr His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 



Gly Pro Gly Gly Glu Lys Trp Met tyr Val Gly Gin Asn Lys Ala Gly 
40 435 440 445 

Gin Val Trp His Asp lie Thr Gly Asn Lys Pro Gly Thr Val Thr lie 
450 455 460 

45 Asn Ala Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser 

465 470 475 480 

lie Trp Val Lys Arg 
485 

50 

(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu 
15 10 15 

Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn 
20 25 30 

Leu Ser Ser Leu Gly He Thr Ala Leu Trp Leu Pro Pro Ala Tyr. Lys 
35 40 45 

Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp 
50 55 60 

Leu Gly Giu Phe Asn Gin Lys Gly Ala Val Arg Thr Lys Tyr Gly Thr 
65 70 75 80 

Lys Ala Gin Tyr Leu Gin Ala He Gin Ala Ala His Ala Ala Gly Met 
85 90 95 

Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly 
100 105 110 

Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gin 
115 120 125 

Glu He Ser Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe Asp Phe 
130 135 . 140 

Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His 
145 150 155 160 

Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg He Tyr 
165 170 175 

Lys Phe Arg Gly He Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu 
180 185 190 

Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His 
195 200 205 

Pro Glu Val Val Thr Glu Leu Lys Ser Trp Gly Lys Trp Tyr Val Asn 
210 215 220 

Thr Thr Asn He Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys 
225 230 235 " 240 

Phe Ser Phe Phe Pro Asp Trp Leu Ser Asp Val Arg Ser Gin Thr Gly 
245 250 255 

Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp He Asn Lys 
260 265 270 



Leu His Asn Tyr He Met Lys Thr Asn Gly Thr Met Ser Leu Phe Asp 
275 280 285 



Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Thr 
290 295 300 
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Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gin Pro 
305 310 315 320 

Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gin 
325 330 335 

Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 
340 345 350 

Phe lie Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp 
355 360 365 

Tyr Tyr Gly lie Pro Gin Tyr Asn lie Pro Ser Leu Lys Ser Lys lie 
370 375 380 

Asp Pro Leu Leu lie Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His 
385 390 395 400 

Asp Tyr Leu Asp His Ser Asp lie lie Gly Trp Thr Arg Glu Gly Val 
405 410 415 

Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro 
420 425 430 

Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys Val 
435 440 445 

Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr lie Asn Ser 
450 455 460 

Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp 
465 470 475 480 

Val Pro Arg Lys Thr Thr Val Ser Thr lie Ala Trp Ser He Thr Thr 
485 490 495 

Arg Pro Trp Thr Asp Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val 
500 505 510 



Ala Trp 



(2) INFORMATION FOR SEQ ID NO; 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro 
1 "5 10 15 

Asn Asp Gly Gin His Trp Arg Arg Leu Gin Asn Asp Ser Ala Tyr Leu 
20 25 30 

Ala Glu His Gly He Thr Ala Val Trp He Pro Pro Ala Tyr Lys Gly 
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35 40 45 

Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu 
50 55 60 

5 

Gly Glu Phe His Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 
65 70 75 80 

Gly Glu Leu Gin Ser Ala lie Lys Ser Leu His Ser Arg Asp lie Asn 
10 .85 90 * 95 

Val Tyr Gly Asp Val Val He Asn His Lys Gly Gly Ala Asp Ala Thr 
100 105 ' 110 

15 Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val 

115 120 125 



20 



35 



50 



He Ser Gly Glu His Leu He Lys Ala Trp Thr His Phe His Phe Pro 
130 135 140 

Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 

145 150 155 160 



Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg He Tyr Lys 
25 165 170 175 

Phe Gin Gly Lys Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn 
180 185 190 

30 Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Tyr Asp His Pro Asp Val 

195 200 * 205 

Ala Ala Glu He Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin 
210 215 220 



Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His lie Lys Phe Ser Phe 
225 230 235 240 



Leu Arg Asp Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met 

40 245 250 255 

Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp Leu Gly Ala Leu Glu Asn 

260 265 270 

45 Tyr Leu Asn Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu 

275 280 285 



His Tyr Gin Phe His Ala Ala Ser Thr Gin Gly Gly Gly Tyr Asp Met 

290 295 300 

Arg Lys Leu Leu Asn Gly Thr Val Val Ser Lys His Pro Leu Lys Ser 

305 310 315 320 



Val Thr Phe Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 

55 325 330 335 

Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu 

340 345 350 
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Thr Arg Glu Ser 

Thr Lys Gly Asp 
'370 

Glu Pro lie Leu 
385 

Asp Tyr Phe Asp 



Ser Ser Val Ala 
420 

Gly Gly Ala Lys 
435 

Trp His Asp lie 
450 

Glu Gly Trp Gly 
465 

Val Gin Arg 



Gly . Pro 



Ser Gin Arg Glu 
375 

Lys Ala Arg Lys 
390 

His His Asp lie 
405 

Asn Ser Gly Leu 



Arg Met Tyr Val 
440 

Thr Gly Asn Arg 
455 

Glu Phe His Val 
470 



8 

Val Phe Tyr Gly 



lie Pro Ala Leu 
380 

Gin Tyr Ala Tyr 
395 

Val Gly Trp Thr 
410 

Ala Ala Leu lie 
425 

Gly Arg Gin Asn 



Ser Glu Pro Val 
460 

Asn Gly Gly Ser 
475 



Asp Met Tyr Gly 
365 

Lys His Lys lie . 



Gly Ala Gin His 
400 

Arg Glu Gly Asp 
415 

Thr Asp Gly Pro 
430 

Ala Gly Glu Thr 
445 

Val lie Asn Ser 



Val Ser lie Tyr 
480 



) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 80 amino acids 
"(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Val Asn Gly Thr Leu Met Gin Tyr 
1 5 

Gly Gin His Trp Lys Arg Leu Gin 
20 

He Gly He Thr Ala Val Trp He 
35 40 

Gin Ser Asp Asn Gly Tyr Gly Pro 
50 55 



Phe Glu Trp Tyr Thr Pro Asn Asp 
10 15 

Asn Asp Ala Glu His Leu Ser Asp 
25 30 

Pro Pro Ala Tyr Lys Gly Leu Ser 
45 

Tyr Asp Leu Tyr Asp Leu Gly Glu 
60 



Phe Gin Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ser Glu 
65 70 75 80 

Leu Gin Asp Ala He Gly Ser Leu His Ser Arg Asn Val Gin Val Tyr 
85 90 95 



Gly Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala Thr Glu Asp 
100 105 ' no 
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Val Thr Ala Val Glu Val Asn Pro Ala Asn Arg Asn Gin Glu Thr Ser 
115 120 125 

Glu Glu Tyr Gin lie Lys Ala Trp Thr Asp Phe Arg Phe Pro Gly Arg 
130 135 140 

Gly Asn Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly 
145 150 155 160 



10 



Ala Asp Trp Asp Glu Ser Arg Lys lie Ser Arg lie Phe Lys Phe Arg 
165 170 175 



15 



Gly Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn 

180 185 190 

Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro Asp Val 

195 200 205 



20 



Val Ala Glu Thr Lys Lys Trp Gly lie Trp Tyr Ala Asn Glu Leu Ser 
210 215 220 



Leu Asp Gly Phe Arg lie Asp Ala Ala Lys His lie Lys Phe Ser Phe 
225 230 235 240 



25 



Leu Arg Asp Trp Val Gin Ala Val Arg Gin Ala Thr Gly Lys Glu Met 
245 250 255 



30 



Phe Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu Glu Asn 
260 265 270 

Tyr Leu Asn Lys Thr Ser Phe Asn Gin Ser Val Phe Asp Val Pro Leu 
275 280 285 



35 



His Phe Asn Leu Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr Asp Met 
290 295 300 



Arg Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu Lys Ala 
305 310 315 320 



40 



Val Thr Phe Val Glu Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 
325 330 335 



45 



Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu 
340 345 350 

Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly 
355 360 365 



50 



Thr Lys Gly Thr Ser Pro Lys Glu lie Pro Ser Leu Lys Asp Asn lie 
370 375 380 



Glu Pro lie Leu Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro Gin His 
385 390 395 400 



55 



Asp Tyr lie Asp His Pro Asp Val lie Gly Trp Thr Arg Glu Gly Asp 
405 410 415 



Ser Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro 
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4 20 

Gly Gly Ser Lys Arg 
435 

Trp Tyr Asp lie Thr 
450 

Asp Gly Trp Gly Glu 
465 



10 
425 

Met Tyr Ala Gly Leu 
440 

Gly Asn Arg Ser Asp 
455 

Phe His Val Asn Asp 
470 



430 

Lys Asn Ala Gly Glu Thr 
445 

Thr Val Lys lie Gly Ser 
460 

Gly Ser Val Ser lie Tyr 
475 480 



INFORMATION FOR SEQ I D NO : 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ I D NO : 6: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 15 



Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Asn Ser Asp Ala Ser 
20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie 'Pro Pro Ala Trp 
35 40 45 

Lys Gly Ala Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Ser Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 

lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 HO 

Ala Thr Glu Met Val Arg Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu Val Thr Gly Glu Tyr Thr He Glu Ala TrD Thr Arg Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Thr His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Arg Leu Asn Asn Arg 
165 170 175 



He Tyr Lys Phe Arg Gly His Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 
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10 



25 



40 



55 



Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Gly Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp He Asn His Val Arg Ser Ala 
245 250 255 



Thr Gly Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 

15 260 . 265 270 

Gly Ala He Glu Asn Tyr Leu Gin Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

20 Phe Asp Val Pro . Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly 

290 295 300 



Gly Asn Tyr Asp Met Arg Asn lie Phe Asn Gly Thr Val Val Gin Arg 

305 310 315 320 

His Pro Ser His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 

325 330 335 



Glu Glu Ala Leu Glu Ser Phe Val Glu Glu Trp Phe Lys Pro Leu Ala 
30 \ 340 34 5 350 

Tyr Ala Leu Thr Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
: 355 360 365 

35 Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Arg Ser 

370 375 380 

x,ys He Asp Pro He Leu Glu Ala Arg Gin Lys Tyr Ala Tyr Gly Lys 
385 390 395 400 



Gin Asn Asp Tyr Leu Asp His His Asn He He Gly Trp Thr Arg Glu 
405 410 415 



Gly Asn Thr Ala His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
45 420 425 430 

Gly Ala Gly Gly Ser Lys Trp Met Phe Val Gly Arg Asn Lys Ala Gly 
435 440 445 

50 Gin Val Trp Ser Asp He Thr Gly Asn Arg Thr Gly Thr Val Thr He 

450 455 460 



Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

He Trp Val Asn Lys 
485 
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10 



25 



40 



55 



12 



(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 15 



Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
15 20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie. Pro Pro Ala Trp 
35 40 45 

20 Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 

50 55 60 



Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 



lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
30 100 105 110 

Gly Thr Glu lie Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
115 120 125 

35 Gin Glu Thr Ser Gly Glu Tyr Ala lie Glu Ala Trp Thr Lys Phe Asp 

130 135 140 



Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
165 170 175 



He Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
45 180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 

195 200 205 

50 Asp His Pro Glu Val He His Glu Leu Arg Asn Trp Gly Val Trp Tyr 

210 215 220 



Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 
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Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu ' 

260 265 270 

Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 

275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 

290 295 300 



10 



15 



Gly Tyr Tyr Asp Met Arg Asn He Leu Asn Gly Ser Val Val Gin Lys 

305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
340 345 350 



20 



Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 



Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 



25 



30 



Lys He Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asp He He Gly Trp Thr Arg Glu 
405 410 415 

Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 



35 



Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
435 440 445 



40 



45 



50 



Gin Val Trp Arg Asp He Thr Gly Asn Arg Thr Gly Thr Val Thr He 
450 455 460 

Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

Val Trp Val Lys Gin 
485 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



55 



His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp His 



10 



15 



Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala S 



er 
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25 



30 



Asn Leu Arg Asn Arg Gly lie Thr Ala He Trp He Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Ser Gin Leu Glu Ser Ala He His Ala Leu Lys Asn Asn Gly 
85 90 95 

Val Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu He Ser Gly Asp Tyr Thr He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 
165 170 175 

lie Tyr Lys Phe Arg Gly Asp Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Aia Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Ala 
245 250 255 

Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 " 270 

Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 
305 310 315 320 



His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 



1A1 I > 



10 



25 
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Gly Glu Ser Leu Glu Ser Phe Val Gin Glu Trp Phe Lys Pro Leu Ala 

340 345 350 

Tyr Ala Leu "Tie Eeii" Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 

355 .:"3 60 365 

Gly Asp Tyr Tyr Gly He Pro Thr His Ser Val Pro Ala Met Lys Ala 
370 375 380 

Lys He Asp Pro He Leu Glu Ala Arg Gin Asn Phe Ala Tyr Gly Thr 
385 390 395 400 



Gin His Asp Tyr Phe Asp His His Asn He lie Gly Trp Thr Arg Glu 
15 405 410 415 

Gly Asn Thr Thr His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 

20 Gly Pro Gly Gly Glu Lys Trp Met Tyr Val Gly Gin Asn Lys Ala Gly 

435 440 . 445 



Gin Val Trp His Asp He Thr Gly Asn Lys Pro Gly Thr Val Thr lie 

450 455 460 

Asn Ala Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 



He Trp Val Lys Arg 
30 : 485 

(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

40 

CATCATAATG GAACAAATGG TACTATGATG CAATATTTCG AATGGTATTT GCCAAATGAC 60 

GGGAATCATT GGAACAGGTT GAGGGATGAC GCAGCTAACT TAAAGAGTAA AGGGATAACA 120 

45 GCTGTATGGA TCCCACCTGC ATGGAAGGGG ACTTCCCAGA ATGATGTAGG TTATGGAGCC 180 

TATGATTTAT ATGATCTTGG AGAGTTTAAC CAGAAGGGGA CGGTTCGTAC AAAATATGGA 24 0 

ACACGCAACC AGCTACAGGC TGCGGTGACC TCTTTAAAAA ATAACGGCAT TCAGGTATAT 300 

50 

GGTGATGTCG TCATGAATCA TAAAGGTGGA GCAGATGGTA CGGAAATTGT AAATGCGGTA 360 

GAAGTGAATC GGAGCAACCG AAACCAGGAA ACCTCAGGAG AGTATGCAAT AGAAGCGTGG 4 20 

55 ACAAAGTTTG ATTTTCCTGG AAGAGGAAAT AACCATTCCA GCTTTAAGTG GCGCTGGTAT 4 80 

CATTTTGATG GGACAGATTG GGATCAGTCA CGCCAGCTTC AAAACAAAAT ATATAAATTC 54 0 
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AGGGGAACAG GCAAGC i GGACTGGGAA GTCGATACAG AGAATGGCAA CTATGACTAT 600 

CTTATGTATG CAGACGTGGA TATGGATCAC CCAGAAGTAA TACATGAACT TAGAAACTGG 660 

5 ' GGAGTGTGGT ATACGAATAC ACTGAACCTT GATGGATTTA GAATAGATGC AGTGAAACAT 720 

ATAAAATATA GCTTTACGAG AGATTGGCTT ACACATGTGC GTAACACCAC AGGTAAACCA 780 

ATGTTTGCAG TGGCTGAGTT TTGGAAAAAT GACCTTGGTG CAATTGAAAA CTATTTGAAT 84 0 

10 

AAAACAAGTT GGAATCACTC GGTGTTTGAT GTTCCTCTCC ACTATAATTT GTACAATGCA 900 

TCTAATAGCG GTGGTTATTA TGATATGAGA AATATTTTAA ATGGTTCTGT GGTGCAAAAA 960 

15 CATCCAACAC ATGCCGTTAC TTTTGTTGAT AACCATGATT CTCAGCCCGG GGAAGCATTG 1020 

GAATCCTTTG TTCAACAATG GTTTAAACCA CTTGCATATG CATTGGTTCT GACAAGGGAA 1080 

CAAGGTTATC CTTCCGTATT TTATGGGGAT TACTACGGTA TCCCAACCCA TGGTGTTCCG 114 0 

20 

GCTATGAAAT CTAAAATAGA CCCTCTTCTG CAGGCACGTC AAACTTTTGC CTATGGTACG 1200 

CAGCATGATT ACTTTGATCA TCATGATATT ATCGGTTGGA CAAGAGAGGG AAATAGCTCC 12 60 

25 CATCCAAATT CAGGCCTTGC CACCATTATG TCAGATGGTC CAGGTGGTAA CAAATGGATG 1320 

TATGTGGGGA AAAATAAAGC GGGACAAGTT TGGAGAGATA TTACCGGAAA TAGGACAGGC 138 0 

ACCGTCACAA TTAATGCAGA CGGATGGGGT AATTTCTCTG TTAATGGAGG GTCCGTTTCG 14 4 0 

30 

GTTTGGGTGA AGCAA 14 55 

(2) INFORMATION FOR SEQ ID NO: 10: 
<i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1455 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CATCATAATG GGACAAATGG GACGATGATG CAATACTTTG AATGGCACTT GCCTAATGAT 60 

GGGAATCACT GGAATAGATT AAGAGATGAT GCTAGTAATC TAAGAAATAG AGGTATAACC 120 

45 

GCTATTTGGA TTCCGCCTGC CTGGAAAGGG ACTTCGCAAA ATGATGTGGG GTATGGAGCC 180 

TATGATCTTT ATGATTTAGG GGAATTTAAT CAAAAGGGGA CGGTTCGTAC TAAGTATGGG 2 40 

50 ACACGTAGTC AATTGGAGTC TGCCATCCAT GCTTTAAAGA ATAATGGCGT TCAAGTTTAT 300 

GGGGATGTAG TGATGAACCA TAAAGGAGGA GCTGATGCTA CAGAAAACGT TCTTGCTGTC 360 

GAGGTGAATC CAAATAACCG GAATCAAGAA ATATCTGGGG ACTACACAAT TGAGGCTTGG 4 20 

55 

ACTAAGTTTG ATTTTCCAGG GAGGGGTAAT ACATACTCAG ACTTTAAATG GCGTTGGTAT 4 80 

CATTTCGATG GTGTAGATTG GGATCAATCA CGACAATTCC AAAATCGTAT CTACAAATTC 54 0 
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45 



55 



CGAGGTGATG 


GTAAGGCATG 


GGATTGGGAA 


GTAGATTCGG 


AAAATGGAAA 


TTATGATTAT 


600 


TTAATGTATG 


CAGATGTAGA 


TATGGATCAT 


CCGGAGGTAG 


TAAATGAGCT 


TAGAAGATGG 


660 


GGAGAATGGT 


ATACAAATAC 


ATTAAATCTT 


GATGGATTTA 


GGATCGATGC 


GGTGAAGCAT 


720 


ATTAAATATA 


GCTTTACACG 


TGATTGGTTG 


ACCCATGTAA 


GAAACGCAAC 


GGGAAAAGAA 


780 


ATGTTTGCTG 


TTGCTGAATT 


TTGGAAAAAT 


GATTTAGGTG 


CCTTGGAGAA 


CTATTTAAAT 


840 


AAAACAAACT 


GGAATCATTC 


TGTCTTTGAT 


GTCCCCCTTC 


ATTATAATCT 


TTATAACGCG 


900 


TCAAATAGTG 


GAGGCAACTA 


TGACATGGCA 


AAACTTCTTA 


ATGGAACGGT 


TGTTCAAAAG 


960 


CATCCAATGC 


ATGCCGTAAC 


TTTTGTGGAT 


AATCACGATT 


CTCAACCTGG 


GGAATCATTA 


1020 


GAATCATTTG 


TACAAGAATG 


GTTTAAGCCA 


CTTGCTTATG 


CGCTTATTTT 


AACAAGAGAA 


1080 


CAAGGCTATC 


CCTCTGTCTT 


CTATGGTGAC 


TACTATGGAA 


TTCCAACACA 


TAGTGTCCCA 


1140 


GCAATGAAAG 


CCAAGATTGA 


TCCAATCTTA 


GAGGCGCGTC 


AAAATTTTGC 


ATATGGAACA 


1200 


CAACATGATT 


ATTTTGACCA 


TCATAATATA 


ATCGGATGGA 


CACGTGAAGG 


AAATACCACG 


1260 


CATCCCAATT 


CAGGACTTGC 


GACTATCATG 


TCGGATGGGC 


CAGGGGGAGA 


GAAATGGATG 


1320 


TACGTAGGGC 


AAAATAAAGC 


AGGTCAAGTT 


TGGCATGACA 


TAACTGGAAA 


TAAACCAGGA 


1380 


ACAGTTACGA 


TCAATGCAGA 


TGGATGGGCT 


AATTTTTCAG 


TAAATGGAGG 


ATCTGTTTCC 


1440 


ATTTGGGTGA 


AACGA 










1455 



10 



15 



20 



25 



30 



35 (2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



GCCGCACCGT TTAACGGCAC CATGATGCAG TATTTTGAAT GGTACTTGCC GGATGATGGC 60 

ACGTTATGGA CCAAAGTGGC CAATGAAGCC AACAACTTAT CCAGCCTTGG CATCACCGCT 120 

CTTTGGCTGC CGCCCGCTTA CAAAGGAACA AGCCGCAGCG ACGTAGGGTA CGGAGTATAC 18 0 

50 GACTTGTATG ACCTCGGCGA ATTCAATCAA AAAGGGACCG TCCGCACAAA ATACGGAACA 2 40 

AAAGCTCAAT ATCTTCAAGC CATTCAAGCC GCCCACGCCG- CTGGAATGCA AGTGTACGCC 300 

GATGTCGTGT TCGACCATAA AGGCGGCGCT GACGGCACGG AATGGGTGGA CGCCGTCGAA 3 60 

GTCAATCCGT CCGACCGCAA CCAAGAAATC TCGGGCACCT ATCAAATCCA AGCATGGACG 4 20 

AAATTTGATT TTCCCGGGCG GGGCAACACC TACTCCAGCT TTAAGTGGCG CTGGTACCAT 4 80 
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TTTGACGGCG TTGATTGGGA CGAAAGCCGA AAATTGAGCC GCATTTACAA ATTCCGCGGC 54 0 

ATCGGCAAAG CGTGGGATTG GGAAGTAGAC ACGGAAAACG GAAACTATGA CTACTTAATG 600 

TATGCCGACC TTGATATGGA TCATCCCGAA GTCGTGACCG AGCTGAAAAA CTGGGGGAAA 6 60 

TGGTATGTCA ACACAACGAA CATTGATGGG TTCCGGCTTG ATGCCGTCAA GCATATTAAG 720 

TTCAGTTTTT TTCCTGATTG GTTGTCGTAT GTGCGTTCTC AGACTGGCAA GCCGCTATTT 7 80 

ACCGTCGGGG AATATTGGAG CTATGACATC AACAAGTTGC ACAATTACAT TACGAAAACA 840 

GACGGAACGA TGTCTTTGTT TGATGCCCCG TTACACAACA AATTTTATAC CGCTTCCAAA 900 

TCAGGGGGCG CATTTGATAT GCGCACGTTA ATGACCAATA CTCTCATGAA AGATCAACCG 960 

ACATTGGCCG TCACCTTCGT TGATAATCAT GACACCGAAC CCGGCCAAGC GCTGCAGTCA 1020 

TGGGTCGACC CATGGTTCAA ACCGTTGGCT TACGCCTTTA TTCTAACTCG GCAGGAAGGA 1080 

TACCCGTGCG TCTTTTATGG TGACTATTAT GGCATTCCAC AATATAACAT TCCTTCGCTG 114 0 

AAAAGCAAAA TCGATCCGCT CCTCATCGCG CGCAGGGATT ATGCTTACGG AACGCAACAT 1200 

GATTATCTTG ATCACTCCGA CATCATCGGG TGGACAAGGG AAGGGGGCAC TGAAAAACCA 12 60 

GGATCCGGAC TGGCCGCACT GATCACCGAT GGGCCGGGAG GAAGCAAATG GATGTACGTT 1320 

GGCAAACAAC ACGCTGGAAA AGTGTTCTAT GACCTTACCG GCAACCGGAG TGACACCGTC 1380 

ACCATCAACA GTGATGGATG GGGGGAATTC AAAGTCAATG GCGGTTCGGT TTCGGTTTGG 14 4 0 

GTTCCTAGAA AAACGACCGT TTCTACCATC GCTCGGCCGA TCACAACCCG ACCGTGGACT 1500 

GGTGAATTCG TCCGTTGGAC CGAACCACGG TTGGTGGCAT GGCCTTGA 154 8 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 421. .1872 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CGGAAGATTG GAAGTACAAA AATAAGCAAA AGATTGTCAA TCATGTCATG AGCCATGCGG 60 

GAGACGGAAA AATCGTCTTA ATGCACGATA TTTATGCAAC GTTCGCAGAT GCTGCTGAAG 120 

AGATTATTAA AAAGCTGAAA GCAAAAGGCT ATCAATTGGT AACTGTATCT CAGCTTGAAG 180 

AAGTGAAGAA GCAGAGAGGC TATTGAATAA ATGAGTAGAA GCGCCATATC GGCGCTTTTC 24 0 
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TTTTGGAAGA AAATATAGGG AAAATGGTAC TTGTTAAAAA TTCGGAATAT TTATACAACA 300 
TCATATGTTT CACATTGAAA GGGGAGGAGA ATCATGAAAC AACAAAAACG GCTTTACGCC 360 
CGATTGCTGA CGCTGTTATT TGCGCTCATC TTCTTGCTGC CTCATTCTGC AGCAGCGGCG 4 20 

GCA AAT CTT AAT GGG ACG CTG ATG CAG TAT TTT GAA TGG TAC ATG CCC 4 68 

AAT GAC GGC CAA CAT TGG AGG CGT TTG CAA AAC GAC TCG GCA TAT TTG 516 
GCT GAA CAC GGT ATT ACT GCC GTC TGG ATT CCC CCG GCA TAT AAG GGA 564 
ACG AGC CAA GCG GAT GTG GGC TAC GGT GCT TAC GAC CTT TAT GAT TTA 612 
15 GGG GAG TTT CAT CAA AAA GGG ACG GTT CGG ACA AAG TAC GGC ACA AAA 660 
GGA GAG CTG CAA TCT GCG ATC AAA AGT CTT CAT TCC CGC GAC ATT AAC 7 08 

GTT TAC GGG GAT GTG GTC ATC AAC CAC AAA GGC GGC GCT GAT GCG ACC 75 6 

20 

GAA GAT GTA ACC GCG GTT GAA GTC GAT CCC GCT GAC CGC AAC CGC GTA 804 

ATT TCA GGA GAA CAC CTA ATT AAA GCC TGG ACA CAT TTT CAT TTT CCG 852 

25 GGG CGC GGC AGC ACA TAC AGC GAT TTT AAA TGG CAT TGG TAC CAT TTT 900 

GAC GGA ACC GAT TGG GAC GAG TCC CGA AAG CTG AAC CGC ATC TAT AAG 94 8 

TTT CAA GGA AAG GCT TGG GAT TGG GAA GTT TCC AAT GAA AAC GGC AAC 996 
30 S 

TAT GAT TAT TTG ATG TAT GCC GAC ATC GAT TAT GAC CAT CCT GAT GTC 104 4 

GCA GCA GAA ATT AAG AG A TGG GGC ACT TGG TAT GCC AAT GAA CTG CAA 1092 

35 TTG GAC GGT TTC CGT CTT GAT GCT GTC AAA CAC ATT AAA TTT TCT TTT 1140 

TTG CGG GAT TGG GTT AAT CAT GTC AGG GAA AAA ACG GGG AAG GAA ATG 1188 

TTT ACG GTA GCT GAA TAT TGG CAG AAT GAC TTG GGC GCG CTG GAA AAC 1236 

40 

TAT TTG AAC AAA ACA AAT TTT AAT CAT TCA GTG TTT GAC GTG CCG CTT 1284 

CAT TAT CAG TTC CAT GCT GCA TCG ACA CAG GGA GGC GGC TAT GAT ATG 1332 

45 AGG AAA TTG CTG AAC GGT ACG GTC GTT TCC AAG CAT CCG TTG AAA TCG 138 0 

GTT ACA TTT GTC GAT AAC CAT GAT ACA CAG CCG GGG CAA TCG CTT GAG 14 28 

TCG ACT GTC CAA ACA TGG TTT AAG CCG CTT GCT TAC GCT TTT ATT CTC 14 7 6 

50 

ACA AGG GAA TCT GGA TAC CCT CAG GTT TTC TAC GGG GAT ATG TAC GGG 1524 

ACG AAA GGA GAC TCC CAG CGC GAA ATT CCT GCC TTG AAA CAC AAA ATT 1572 

55 GAA CCG ATC TTA AAA GCG AGA AAA CAG TAT GCG TAC GGA GCA CAG CAT 1620 

GAT TAT TTC GAC CAC CAT GAC ATT GTC GGC TGG ACA AGG GAA GGC GAC 1668 
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AGC TCG GTT GCA AAT TCA GGT TTG GCG GCA TTA ATA ACA GAC GGA CCC 1716 

GGT GGG GCA AAG CGA ATG TAT GTC GGC CGG CAA AAC GCC GGT GAG ACA 17 64 

TGG CAT GAC ATT ACC GGA AAC CGT TCG GAG CCG GTT GTC ATC AAT TCG 1812 

GAA GGC TGG GGA GAG TTT CAC GTA AAC GGC GGG TCG GTT TCA ATT TAT 18 60 

GTT CAA AGA TAG AAGAGCAGAG AGGACGGATT TCCTGAAGGA AATCCGTTTT 1912 
TTTATTTT 1920 



(2) INFORMATION FOR SEQ ID NO: 12: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2084 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 343. . 1794 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

25 

GCCCCGCACA TACGAAAAGA CTGGCTGAAA ACATTGAGCC TTTGATGACT GATGATTTGG 60 
CTGAAGAAGT GGATCGATTG TTTGAGAAAA G AAG AAG ACC ATAAAAATAC CTTGTCTGTC 120 
30 ATCAGACAGG GTATTTTTTA TGCTGTCCAG ACTGTCCGCT GTGTAAAAAT AAGGAATAAA 180 
GGGGGGTTGT TATTATTTTA CTGATATGTA AAATATAATT TGTATAAGAA AATGAGAGGG 24 0 

AGAGGAAACA TGATTCAAAA ACGAAAGCGG ACAGTTTCGT TCAGACTTGT GCTTATGTGC 300 

35 

ACGCTGTTAT TTGTCAGTTT GCCGATTACA AAAACATCAG CC GTA AAT GGC ACG 354 
CTG ATG CAG TAT TTT GAA TGG TAT ACG CCG AAC GAC GGC CAG CAT TGG 4 02 

40 AAA CGA TTG CAG AAT GAT GCG GAA CAT TTA TCG GAT ATC GGA ATC ACT 4 50 

GCC GTC TGG ATT CCT CCC GCA TAG AAA GGA TTG AGC CAA TCC GAT AAC 4 98 

GGA TAC GGA CCT TAT GAT TTG TAT GAT TTA GGA GAA TTC CAG CAA AAA 54 6 

45 

GGG ACG GTC AGA ACG AAA TAC GGC ACA AAA TCA GAG CTT CAA GAT GCG 5 94 

ATC GGC TCA CTG CAT TCC CGG AAC GTC CAA GTA TAC GGA GAT GTG GTT 64 2 

50 TTG AAT CAT AAG GCT GGT GCT GAT GCA ACA GAA GAT GTA ACT GCC GTC 690 
GAA GTC AAT CCG GCC AAT AGA AAT CAG GAA ACT TCG GAG GAA TAT CAA 7 38 

ATC AAA GCG TGG ACG GAT TTT CGT TTT CCG GGC CGT GGA AAC ACG TAC 78 6 

AGT GAT TTT AAA TGG CAT TGG TAT CAT TTC GAC GGA GCG GAC TGG GAT 8 34 

GAA TCC CGG AAG ATC AGC CGC ATC TTT AAG TTT CGT GGG GAA GGA AAA 882 
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GCG TGG GAT TGG GAA GTA TCA AGT GAA AAC GGC AAC TAT GAC TAT TTA 930 
ATG TAT GCT GAT GTT GAC TAC GAC CAC CCT GAT GTC GTG GCA GAG ACA 978 

AAA AAA TGG GGT ATC TGG TAT GCG AAT GAA CTG TCA TTA GAC GGC TTC 1026 

CGT ATT GAT GCC GCC AAA CAT ATT AAA TTT TCA TTT CTG CGT GAT TGG 107 4 

GTT CAG GCG GTC AGA CAG GCG ACG GGA AAA GAA ATG TTT ACG GTT GCG 1122 

GAG TAT TGG CAG AAT AAT GCC GGG AAA CTC GAA AAC TAC TTG AAT AAA 1170 

ACA AGC TTT AAT CAA TCC GTG TTT GAT GTT CCG CTT CAT TTC AAT TTA 1218 

CAG GCG GCT TCC TCA CAA GGA GGC GGA TAT GAT ATG AGG CGT TTG CTG 1266 

GAC GGT ACC GTT GTG TCC AGG CAT CCG GAA AAG GCG GTT ACA TTT GTT 1314 

GAA AAT CAT GAC ACA CAG CCG GGA CAG TCA TTG GAA TCG ACA GTC CAA 13 62 

ACT TGG TTT AAA CCG CTT GCA TAC GCC TTT ATT TTG ACA AGA GAA TCC 1410 

GGT TAT CCT CAG GTG TTC TAT GGG GAT ATG TAC GGG ACA AAA GGG ACA 14 58 

TCG CCA AAG GAA ATT CCC TCA CTG AAA GAT AAT ATA GAG CCG ATT TTA 1506 

AAA GCG CGT AAG GAG TAC GCA TAC GGG CCC CAG CAC GAT TAT ATT GAC 1554 

CAC CCG GAT GTG ATC GGA TGG ACG AGG GAA GGT GAC AGC TCC GCC GCC 1602 

AAA TCA GGT TTG GCC GCT TTA ATC ACG GAC GGA CCC GGC GGA TCA AAG 1650 

CGG ATG TAT GCC GGC CTG AAA AAT GCC GGC GAG ACA TGG TAT GAC ATA 1698 

ACG GGC AAC CGT TCA GAT ACT GTA AAA ATC GGA TCT GAC GGC TGG GGA 174 6 

GAG TTT CAT GTA AAC GAT GGG TCC GTC TCC ATT TAT GTT CAG AAA TAA 17 94 

GGTAATAAAA AAACACCTCC AAGCTGAGTG CGGGTATCAG CTTGGAGGTG CGTTTATTTT 1854 

TTCAGCCGTA TGACAAGGTC GGCATCAGGT GTGACAAATA CGGTATGCTG GCTGTCATAG 1914 

GTGACAAATC CGGGTTTTGC GCCGTTTGGC TTTTTCACAT GTCTGATTTT TGTATAATCA 197 4 

ACAGGCACGG AGCCGGAATC TTTCGCCTTG GAAAAATAAG CGGCGATCGT AGCTGCTTCC 2034 

AATATGGATT GTTCATCGGG ATCGCTGCTT TTAATCACAA CGTGGGATCC 2084 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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CATCATAATG GAACAAATGG TACTATGATG CAATATTTCG AATGGTATTT GCCAAATGAC 60 

GGGAATCATT GGAACAGGTT GAGGGATGAC GCAGCTAACT TAAAGAGTAA AGGGATAACA 12 0 

5 

GCTGTATGGA TCCCACCTGC ATGGAAGGGG ACTTCCCAGA ATGATGTAGG TTATGGAGCC 180 

TATGATTTAT ATGATCTTGG AGAGTTTAAC CAGAAGGGGA CGGTTCGTAC AAAATATGGA 24 0 

10 ACACGCAACC AGCTACAGGC TGCGGTGACC TCTTTAAAAA ATAACGGCAT TCAGGTATAT 300 

GGTGATGTCG TCATGAATCA TAAAGGTGGA GCAGATGGTA CGGAAATTGT AAATGCGGTA 3 60 

GAAGTGAATC GGAGCAACCG AAACCAGGAA ACCTCAGGAG AGTATGCAAT AGAAGCGTGG 4 20 

15 

ACAAAGTTTG ATTTTCCTGG AAGAGGAAAT AACCATTCCA GCTTTAAGTG GCGCTGGTAT 4 80 

CATTTTGATG GGACAGATTG GGATCAGTCA CGCCAGCTTC AAAACAAAAT ATATAAATTC 54 0 

20 AGGGGAACAG GCAAGGCCTG GGACTGGGAA GTCGATACAG AGAATGGCAA CTATGACTAT 600 

CTTATGTATG CAGACGTGGA TATGGATCAC CCAGAAGTAA TACATGAACT TAGAAACTGG 660 

GGAGTGTGGT ATACGAATAC ACTGAACCTT GATGGATTTA GAATAGATGC AGTGAAACAT 7 20 

25 

ATAAAATATA GCTTTACGAG AGATTGGCTT ACACATGTGC GTAACACCAC AGGTAAACCA 7 80 

ATGTTTGCAG TGGCTGAGTT TTGGAAAAAT GACCTTGGTG CAATTGAAAA GTATTTGAAT 84 0 

30 AAAACAAGTT GGAATCACTC GGTGTTTGAT GTTCCTCTCC ACTATAATTT GTACAATGCA 900 

TCTAATAGCG GTGGTTATTA T GAT AT GAGA AATATTTTAA ATGGTTCTGT GGTGCAAAAA 960 

CATCCAACAC ATGCCGTTAC TTTTGTTGAT AACCATGATT CTCAGCCCGG GGAAGCATTG 1020 
35 • % 

GAATCCTTTG TTCAACAATG GTTTAAACCA CTTGCATATG CATTGGTTCT GACAAGGGAA 1080 

CAAGGTTATC CTTCCGTATT TTATGGGGAT TACTACGGTA TCCCAACCCA TGGTGTTCCG 114 0 

40 GCTATGAAAT CTAAAATAGA CCCTCTTCTG CAGGCACGTC AAACTTTTGC CTATGGTACG 1200 

CAGCATGATT ACTTTGATCA TCATGATATT ATCGGTTGGA CAAGAGAGGG AAATAGCTCC 12 60 

CATCCAAATT CAGGCCTTGC CACCATTATG TCAGATGGTC CAGGTGGTAA CAAATGGATG 1320 

45 

TATGTGGGGA AAAATAAAGC GGGACAAGTT TGGAGAGATA TTACCGGAAA TAGGACAGGC 1380 

ACCGTCACAA TTAATGCAGA CGGATGGGGT AATTTCTCTG TTAATGGAGG GTCCGTTTCG 14 40 

50 GTTTGGGTGA AGCAA 1455 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CATCATAATG GGACAAATGG GACGATGATG CAATACTTTG AATGGCACTT GCCTAATGAT 60 

GGGAATCACT GGAATAGATT AAGAGATGAT GCTAGTAATC TAAGAAATAG AGGTATAACC 120 

GCTATTTGGA TTCCGCCTGC CTGGAAAGGG ACTTCGCAAA ATGATGTGGG GTATGGAGCC 180 

TATGATCTTT ATGATTTAGG GGAATTTAAT CAAAAGGGGA CGGTTCGTAC TAAGTATGGG 24 0 

ACACGTAGTC AATTGGAGTC TGCCATCCAT GCTTTAAAGA ATAATGGCGT TCAAGTTTAT 300 

GGGGATGTAG TGATGAACCA TAAAGGAGGA GCTGATGCTA CAGAAAACGT TCTTGCTGTC 360 

GAGGTGAATC CAAATAACCG GAATCAAGAA ATATCTGGGG ACTACACAAT TGAGGCTTGG 4 20 

ACTAAGTTTG ATTTTCCAGG GAGGGGTAAT ACATACTCAG ACTTTAAATG GCGTTGGTAT 4 80 

CATTTCGATG GTGTAGATTG GGATCAATCA CGACAATTCC AAAATCGTAT CTACAAATTC 54 0 

CGAGGTGATG GTAAGGCATG GGATTGGGAA GTAGATTCGG AAAATGGAAA TTATGATTAT 600 

TTAATGTATG CAGATGTAGA TATGGATCAT CCGGAGGTAG TAAATGAGCT TAGAAGATGG 660 

GGAGAATGGT ATACAAATAC ATTAAATCTT GATGGATTTA GGATCGATGC GGTGAAGCAT 720 

ATTAAATATA GCTTTACACG TGATTGGTTG ACCCATGTAA GAAACGCAAC GGGAAAAGAA 7 80 

ATGTTTGCTG TTGCTGAATT TTGGAAAAAT GATTTAGGTG CCTTGGAGAA CTATTTAAAT 840 

it 

AAAACAAACT GGAATCATTC TGTCTTTGAT GTCCCCCTTC ATTATAATCT TTATAACGCG 900 

TCAAATAGTG GAGGCAACTA TGACATGGCA AAACTTCTTA ATGGAACGGT TGTTCAAAAG 960 

CATCCAATGC ATGCCGTAAC TTTTGTGGAT AATCACGATT CTCAACCTGG GGAATCATTA 1020 

GAATCATTTG TACAAGAATG GTTTAAGCCA CTTGCTTATG CGCTTATTTT AACAAGAGAA 1080 

CAAGGCTATC CCTCTGTCTT CTATGGTGAC TACTATGGAA TTCCAACACA TAGTGTCCCA 114 0 

GCAATGAAAG CCAAGATTGA TCCAATCTTA GAGGCGCGTC AAAATTTTGC ATATGGAACA 1-200 

CAACATGATT ATTTTGACCA TCATAATATA ATCGGATGGA CACGTGAAGG AAATACCACG 1260 

CATCCCAATT CAGGACTTGC GACTATCATG TCGGATGGGC CAGGGGGAGA GAAATGGATG 1320 

TACGTAGGGC AAAATAAAGC AGGTCAAGTT TGGCATGACA TAACTGGAAA TAAACCAGGA 1380 

ACAGTTACGA TCAATGCAGA TGGATGGGCT AATTTTTCAG TAAATGGAGG ATCTGTTTCC 1440 

ATTTGGGTGA AACGA 1455 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY: misc- feature : 
5 (B) OTHER INFORMATION: /desc = "Forward Primer FSA" 

(ix) FEATURE: 

(A) NAME/KEY: misc-feature 
(B) LOCATION: 22-27,2 9,31-33,41 

(D) : OTHER INFORMATION: /Note= 1: 35% A, 65% C 

10 2: 83% G, 17% A 

3: 63% G, 37% T 
4: 86% G, 14% A 
5: 85% G, 15% C 
6: 50% T, 50% C 
15 7: 95% A, 5%G 

8: 58% G, 37% A, 5% T 
9: 86% C, 13% A, 1% G 
10: 83% T, 17% G 
11: 92% G, 8% C 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

caaaatcgta tctacaaatt cl23456a7g 8910tgggatt 
llggaagtaga ttcggaaaat 
60 

25 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-feature: 
35 (B) OTHER INFORMATION: /desc = "Reverse Primer RSA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



40 



55 



gaatttgtag atacgatttt g 
21 



(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
50 (A) NAME /KEY : misc-feature: 

(B) OTHER INFORMATION : /desc = "Primer Bl" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CGATTGCTGA CGCTGTTATT TGCG 24 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic • acid 
5 (ix) FEATURE: 

(A) NAME/KEY : misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer Y2" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

10 CTTGTTCCCT TGTCAGAACC AATG 2 4 

(2) INFORMATION FOR SEQ ID NO: 19: 
( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 30 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
20 (A) NAME/KEY: misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer 101458" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



25 



GTCATAGTTG CCGAAATCTG TATCGACTTC 30 



(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 
35 (B) OTHER INFORMATION : . /desc = " Prima r 101638" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CCCAGTCCCA CGTACGTCCC CTGAATTTATATA TTTTG 35 

40 (2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc- feature : 

(B) OTHER INFORMATION: /desc - "Oligo 1" 
50 (A) NAME /KEY : misc-feature 

(B) LOCATION: 12 

(D) : OTHER INFORMATION: /Note=N= 25% A, 25% C, 25% G, 25% T. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

55 CCCAGTCCCA GNTCTTTCCC CTGAATTTAT ATATTTTG 38 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
fix) FEATURE: 

(A) NAME /KEY : misc-f eature : 

<B) OTHER INFORMATION: /desc = "Primer X2" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GCGTGGACAA AGTTTGATTT TCCTG 25 



(2) INFORMATION FOR SEQ ID NO: 23: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 
vB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY : misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer DA01" 
(xi) SEQUENCE DESCRIPTION: SEQ I D NO : 23: 

25 

2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc- feature : 
35 ~(B) OTHER INFORMATION: /desc - "Primer DA03" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GCATTGGATG CTTTTGAACA ACCG 24 

2) INFORMATION FOR SEQ ID NO: 25: 
40 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc - "Primer DA07" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



50 



CGCAAAATGA TATCGGGTAT GGAGCC 2 6 



(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
5 (A) NAME/ KEY: raise- feature : 

(B) OTHER INFORMATION: /desc = "Primer DA20" 
(ix) FEATURE: 

(A) NAME/KEY: misc-feature 

(B) LOCATION: 13,14 

10 (D) : OTHER INFORMATION: /Note:S= mixture of C and G 

W= mixture of A and T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GTGATGAACC ACSWAGGTGG AGCTGATGC 2 9 

15 (2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY: misc-feature: 

(B) OTHER INFORMATION: /desc = "Primer DA14" 
25 (ix) FEATURE: 

(A) NAME/KEY: misc-feature 

(B) LOCATION: 13,14 

(D): OTHER INFORMATION: /Note:R= mixture of A and G 

Y= mixture of C and T 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GATGGTGTAT. GGRYCAATCA CGACAATTCC 30 

2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
40 (ix) FEATURE: 

(A) NAME /KEY : misc-feature: 

(B) OTHER INFORMATION: /desc = "Primer DA15" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

GGTGTATGGG ATAACTCACG ACAATTCC 28 

45 

2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-feature: 
55 (B) OTHER INFORMATION: /desc = "Primer DA16" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GGTGTATGGG ATCTCTCACG ACAATTCC 



2) INFORMATION FOR SEQ ID NO: 30: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc - "Primer DA17" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
15 GGGATCAATC ACGAAATTTC CAAAATCGTA TC 32 



2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ^ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
25 (A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc - "Primer DA18" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GGGATCAATC ACGACTCTTC CAAAATCGTA TC 32 

30 2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA06" 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GGAAATTATG ATTATATCAT GTATGCAGAT GTAG 34 ' 



2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
50 (ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc - "Primer DA09" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GCTGAATTTT GGTCGAATGA TTTAGGTGCC 30 
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2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 30 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
10 (A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DAll" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GCTGAATTTT GGTCGAATGA TTTAGGTGCC 30 

15 2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION : /desc = "Primer DA21" 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GAATTTTGGA AGTACGATTT AGGTCGG 27 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
35 (ix) FEATURE: 

(A) NAME/ KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc « "Primer DAI 2" 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature 
40 (B) LOCATION: 12,13 

(D): OTHER INFORMATION: /Note:R= mixture of A and G 

Y= mixture of C and T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



45 



GGAAAAACGA TRYCGGTGCC TTGGAGAAC 2 9 



(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY: misc- feature : 
55 (B) OTHER INFORMATION: /desc - "Primer DAI 3" 
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fix) FEATURE: 

(A) NAME/KEY: mi sc- feature 

(B) LOCATION: 14,15 

(D): OTHER INFORMATION: /No'te:R= mixture of A and G 
5 Y= mixture of C and T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

GATTTAGGTG CCTRYCAGAA CTATTTA 27 

2) INFORMATION FOR SEQ ID NO: 38: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA08" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
20 CCCCCTTCAT GAGAATCTTT ATAACG 2 6 

2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
30 (A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA04" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GAATCCGAAC CTCATTACAC ATTCG 25 

35 2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer DA05" 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CGGATGGACT CGAGAAGGAA ATACCACG 38 

2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
55 (ix) FEATURE: 
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(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION : /desc = "Primer DAI 0 " 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CGTAGGGCAA AATCAGGCCG GTCAAGTTTG G 31 

5 

2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc- feature : 
15 (B) OTHER INFORMATION: /desc = "Primer DA22" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CATAACTGGA AATCGCCCGG GAACAGTTAC G 31 

(2) INFORMATION FOR SEQ ID NO: 43: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DAI 9" 
(ix) FEATURE: 
30 (A) NAME /KEY : misc-f eature 

(B) LOCATION: 12 

(D): OTHER INFORMATION: /Note:W= mixture of A and T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 3 

CTGGAAATAA AWCCGGAACA GTTACG 3 6 

35 2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer DA23" 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GGAAATAAAC CAGGACCCGT TACGATCAAT GC 32 



2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
55 (ix) FEATURE: 
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(A) NAME/KEY: misc-feature : 

(B) OTHER INFORMATION: /desc - "Primer DA32 " 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

GAGGCTTGGA CTAGGTTTGA TTTTCCAG 2 3 



2) INFORMATION FOR SEQ ID NO: 4 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear • 
Ui) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-feature: 
15 (B) OTHER INFORMATION: /desc = "Primer DA31" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GCTGAATTTT GGCGCAATGA TTTAGGTGCC 30 



2) INFORMATION FOR SEQ ID NO: 47: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-feature: 

(B) OTHER INFORMATION: /desc = "Primer bm4" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
30 GTGTTTGACG TCCCGCTTCA TGAGAATTTA CAGG 34 



2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
35 (B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
40 (A) NAME /KEY : misc-feature: 

(B) OTHER INFORMATION: /desc = "Primer bm5 M 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GTGTTTGACG TCCCGCTTCA TAAGAATTTA CAGG 34 



45 2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-feature: 

(B) OTHER INFORMATION: /desc = "Primer bm6 M 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTGTTTGACG TCCCGCTTCA TGCCAATTTA CAGG 3 4 



2) INFORMATION FOR SEQ ID NO: 50: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer bm8" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
15 AGGGAATCCG GATACCCTGA GGTTTTCTAC GG 32 

2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
20 • (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
25 (A) NAME/KEY: misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer broil' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GATGTGGTTT TGGATCATAA GGCCGGCGCT GATG 34 



30 2) INFORMATION FOR SEQ ID NO: 52 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer pi" 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

CTGTTATTAA TGCCGCCAAA CC 22 



2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
50 (ix) FEATURE: 

(A) NAME/KEY: misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer p2" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
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GGAAAAGAAA TGTTTACGGT TGCG 2 4 



2) INFORMATION FL 3EQ 10 NO: b'- : 
(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
10 (ix) FEATURE: 

(A) NAME /KEY : misc- feature : 

(B) OTHER INFORMATION: /desc » "Primer p3" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

GAAATGAAGC GGAACATCAA ACACG 25 

15 

2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc- feature : 
25 (B) OTHER INFORMATION: /desc = "Primer p4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GTATGATTTA GGAGAATTCC 20 
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